hardmaru · Mar 25, 2026 · 4:24 PM UTC

hardmaru

Pinned Tweet

hardmaru

@hardmaru

Mar 25

I’m incredibly proud of The AI Scientist team for this milestone publication in @Nature. We started this project to explore if foundation models could execute the entire research lifecycle. Seeing this work validated at this level is a special moment. I truly believe AI will forever change the landscape of how scientific discoveries and scientific progress are made.

Sakana AI

@SakanaAILabs

Mar 25

The AI Scientist: Towards Fully Automated AI Research, Now Published in Nature Nature: nature.com/articles/s41586-0… Blog: sakana.ai/ai-scientist-natur… When we first introduced The AI Scientist, we shared an ambitious vision of an agent powered by foundation models capable of executing the entire machine learning research lifecycle. From inventing ideas and writing code to executing experiments and drafting the manuscript, the system demonstrated that end-to-end automation of the scientific process is possible. Soon after, we shared a historic update: the improved AI Scientist-v2 produced the first fully AI-generated paper to pass a rigorous human peer-review process. Today, we are happy to announce that “The AI Scientist: Towards Fully Automated AI Research,” our paper describing all of this work, along with fresh new insights, has been published in @Nature! This Nature publication consolidates these milestones and details the underlying foundation model orchestration. It also introduces our Automated Reviewer, which matches human review judgments and actually exceeds standard inter-human agreement. Crucially, by using this reviewer to grade papers generated by different foundation models, we discovered a clear scaling law of science. As the underlying foundation models improve, the quality of the generated scientific papers increases correspondingly. This implies that as compute costs decrease and model capabilities continue to exponentially increase, future versions of The AI Scientist will be substantially more capable. Building upon our previous open-source releases (github.com/SakanaAI/AI-Scien…), this open-access Nature publication comprehensively details our system's architecture, outlines several new scaling results, and discusses the promise and challenges of AI-generated science. This substantial milestone is the result of a close and fruitful collaboration between researchers at Sakana AI, the University of British Columbia (UBC) and the Vector Institute, and the University of Oxford. Congrats to the team! @_chris_lu_ @cong_ml @RobertTLange @_yutaroyamada @shengranhu @j_foerst @hardmaru @jeffclune

158

1,230

275,489

hardmaru · Jul 24, 2023 · 11:34 AM UTC

hardmaru

@hardmaru

24 Jul 2023

This design is better than “𝕏”.

Hiroon @amirHiroon

23 Jul 2023

🐦 X @elonmusk

198

9,432

93,223

4,979,624

hardmaru · Oct 1, 2021 · 12:48 AM UTC

hardmaru

@hardmaru

1 Oct 2021

The Mysteries of the Universe

119

18,635

82,365

hardmaru · Oct 5, 2020 · 3:44 AM UTC

hardmaru

@hardmaru

5 Oct 2020

Nothing against @JeffBezos but this is the stuff of evil genius villians 🙃

Wevolver

1,433

10,655

68,041

hardmaru · Apr 15, 2022 · 4:41 PM UTC

hardmaru

@hardmaru

15 Apr 2022

Trolley problem solved:

B作

271

11,199

57,294

hardmaru · Nov 28, 2019 · 7:04 AM UTC

hardmaru

@hardmaru

28 Nov 2019

Gradient descent is used in many ways at Tesla

175

7,410

50,120

hardmaru · Jun 28, 2020 · 2:10 PM UTC

hardmaru

@hardmaru

28 Jun 2020

drone pilots

Barstool Sports

161

1,913

8,646

hardmaru · Feb 23, 2020 · 7:44 AM UTC

hardmaru

@hardmaru

23 Feb 2020

I just ordered this book for my kids.

122

1,838

8,903

hardmaru · Apr 6, 2025 · 1:28 AM UTC

hardmaru

@hardmaru

6 Apr 2025

😅

131

8,436

316,992

hardmaru · May 12, 2023 · 3:03 AM UTC

hardmaru

@hardmaru

12 May 2023

AI Twitter these days. 👇🧵

100

1,112

7,952

797,990

hardmaru · Jan 11, 2023 · 8:10 AM UTC

hardmaru

@hardmaru

11 Jan 2023

The opening line of David Goodstein’s textbook, “States of Matter” 🤯

This Post is from an account that no longer exists.

1,118

7,338

830,851

hardmaru · Apr 23, 2023 · 5:48 AM UTC

hardmaru

@hardmaru

23 Apr 2023

Deploying large language models to production:

703

5,244

398,794

hardmaru · Dec 22, 2024 · 1:28 AM UTC

hardmaru

@hardmaru

22 Dec 2024

AI generated videos out of control 😹

Ring Hyacinth

604

5,477

387,688

hardmaru · Oct 28, 2021 · 5:14 PM UTC

hardmaru

@hardmaru

28 Oct 2021

An auto-encoder with a very strong inductive bias. nitter.app/Damnlnteresting/status…

1,015

4,277

hardmaru · Apr 29, 2023 · 2:43 PM UTC

hardmaru

@hardmaru

29 Apr 2023

Pushing around these little robot soccer players, from DeepMind’s “Learning Agile Soccer Skills for a Bipedal Robot with Deep Reinforcement Learning” paper. arxiv.org/abs/2304.13653 sites.google.com/view/op3-so…

272

665

4,119

1,731,818

hardmaru · Apr 22, 2019 · 1:09 AM UTC

hardmaru

@hardmaru

22 Apr 2019

We're living in a cyberpunk future: “Fooling automated surveillance cameras: adversarial patches to attack person detection” arxiv.org/abs/1904.08653

1,595

3,644

hardmaru · Jun 6, 2023 · 11:24 AM UTC

hardmaru

@hardmaru

6 Jun 2023

QR codes created using Stable Diffusion and ControlNet. This is Art.

442

3,579

534,072

hardmaru · Jan 6, 2023 · 5:43 AM UTC

hardmaru

@hardmaru

6 Jan 2023

A #StableDiffusion model trained on images of Japanese Kanji characters came up with “Fake Kanji” for novel concepts like Skyscraper, Pikachu, Elon Musk, Deep Learning, YouTube, Gundam, Singularity, etc. They kind of make sense. Not bad!

enpitsu @enpitsu

4 Jan 2023

お絵描きAI（Stable Diffusion）に漢字とその意味を1万字学習させて書き初めをさせました順に「謹」「賀」「新」「年」です

104

723

3,549

1,266,845

hardmaru · Mar 1, 2023 · 4:47 AM UTC

hardmaru

@hardmaru

1 Mar 2023

This is what life is like at a Generative AI startup.

377

3,446

347,163

hardmaru · Dec 27, 2020 · 3:09 PM UTC

hardmaru

@hardmaru

27 Dec 2020

Interesting physics analogy of ML from the viewpoint of compression (@elonmusk) “Physics formulas are compression algorithms for reality…If you ran physics simulation of the universe, eventually you will have sentience…At what point from hydrogen to us did it become sentient?”

131

534

3,236

hardmaru · Jan 24, 2025 · 7:55 AM UTC

hardmaru

@hardmaru

24 Jan 2025

DeepSeek is a side project 🔥

anton

@abacaj

22 Jan 2025

How is Deepseek going to make money?

314

3,086

350,031

hardmaru · Jan 30, 2019 · 7:17 AM UTC

hardmaru

@hardmaru

30 Jan 2019

A fun way to learn about neural networks and AI is to implement a simulation game giving your agents little neural net brains, and training them using a simple method like evolution. This demo trains a small neural network to drive around the track after only a few generations:

1,097

3,074

hardmaru · Feb 10, 2021 · 2:08 AM UTC

hardmaru

@hardmaru

10 Feb 2021

The most important formula in deep learning after 2018

495

3,026

hardmaru · May 27, 2023 · 8:46 AM UTC

hardmaru

@hardmaru

27 May 2023

I prefer: MANGA 💕💖✨

ALT Meta, Apple, NVIDIA, Google, Amazon

Yann LeCun

@ylecun

27 May 2023

I propose a new acronym for the AI-ensconced red-hot tech giants: MAGMA Meta, Amazon, Google, Microsoft, Apple.

536

3,011

551,522

hardmaru · May 12, 2025 · 2:16 AM UTC

hardmaru

@hardmaru

12 May 2025

New Paper: Continuous Thought Machines 🧠 Neurons in brains use timing and synchronization in the way that they compute, but this is largely ignored in modern neural nets. We believe neural timing is key for the flexibility and adaptability of biological intelligence. We propose a new neural architecture, “Continuous Thought Machines” (CTMs), which is built from the ground up to use neural dynamics as a core representation for intelligence. By using neural dynamics as a first-class representational citizen, CTMs naturally perform adaptive computation. Many emergent, interesting behaviors arise as a result: CTMs solve mazes by observing a raw maze image and producing step-by-step instructions directly from its neural dynamics. When tasked with image recognition, the CTM naturally takes multiple steps to examine different parts of the image before making its decision. This step-by-step approach not only makes its behavior more interpretable but also improves accuracy: the longer it “thinks,” the more accurate its answers become. We also found that this allows the CTM to decide to spend less time thinking on simpler images, thus saving energy. When identifying a gorilla, for example, the CTM’s attention moves from eyes to nose to mouth in a pattern remarkably similar to human visual attention. I think this work underscores an important, yet often lost, synergy between neuroscience and AI. While modern AI is ostensibly brain-inspired, the two fields often operate in surprising isolation. By starting with such inspiration and iteratively following the emergent, interesting behaviors, we developed a model with unexpected capabilities, such as its surprisingly strong calibration in classification tasks, a feature that was not explicitly designed for. When we initially asked, “why do this research?”, we hoped the journey of the CTM would provide compelling answers. By embracing light biological inspiration and pursuing the novel behaviors observed, we have arrived at a model with emergent capabilities that exceeded our initial designs. We are committed to continuing this exploration, borrowing further concepts to discover what new and exciting behaviors will emerge, pushing the boundaries of what AI can achieve.

549

3,160

257,281

hardmaru · Oct 24, 2020 · 11:35 AM UTC

hardmaru

@hardmaru

24 Oct 2020

One of the most well-known pieces of software for downloading YouTube videos, “youtube-dl” was removed from GitHub following a takedown notice from the Recording Industry Association of America, or RIAA. Someone encoded the source code into two images and put it on Twitter:

You’re unable to view this Post because this account owner limits who can view their Posts.

890

2,863

hardmaru · Jul 13, 2025 · 1:18 PM UTC

hardmaru

@hardmaru

13 Jul 2025

Google’s Gemini 2.5 paper has 3295 authors arxiv.org/abs/2507.06261

hardmaru

@hardmaru

21 Dec 2023

Google’s Gemini paper has ~1000 authors arxiv.org/abs/2312.11805

427

2,994

1,204,866

hardmaru · Jun 2, 2025 · 2:25 AM UTC

hardmaru

@hardmaru

2 Jun 2025

Facebook AI Research is the OG “Open” AI

251

2,939

352,958

hardmaru · Apr 14, 2024 · 6:08 AM UTC

hardmaru

@hardmaru

14 Apr 2024

Announcing NeurIPS Preschool Track This year, we invite preschoolers to submit machine learning research papers.

378

2,833

287,190

hardmaru · Mar 21, 2019 · 6:57 AM UTC

hardmaru

@hardmaru

21 Mar 2019

This person took ~1700 pages of notes in mathematics lectures using LaTeX and Vim, and documented the workflow: castel.dev/post/lecture-note…

872

2,757

hardmaru · Aug 17, 2023 · 1:44 PM UTC

hardmaru

@hardmaru

17 Aug 2023

Personal Announcement! I’m launching @SakanaAILabs together with my friend, Llion Jones (@YesThisIsLion). sakana.ai is a new R&D-focused company based in Tokyo, Japan. We’re on a quest to create a new kind of foundation model based on nature-inspired intelligence!

ALT https://sakana.ai/

141

398

2,742

573,250

hardmaru · May 9, 2020 · 4:28 AM UTC

hardmaru

@hardmaru

9 May 2020

data preprocessing

368

2,647

hardmaru · Oct 22, 2023 · 9:10 AM UTC

hardmaru

@hardmaru

22 Oct 2023

Artificial lifeforms are super fascinating to watch. These self-organizing, self-replicating, “lifeforms” emerged from a continuous time cellular automata system called Flow-Lenia. Lenia is a family of CAs generalizing Conway’s Game of Life to continuous space, time and states.

486

2,542

568,538

hardmaru · Sep 30, 2022 · 10:13 PM UTC

hardmaru

@hardmaru

30 Sep 2022

Some personal news: After six years at Google, I decided it was time for me to leave and try something new again. I had a fantastic time at Google Brain, and I’ll miss my friends, collaborators, and hanging out at the microkitchens!

129

2,585

hardmaru · Dec 30, 2022 · 12:55 PM UTC

hardmaru

@hardmaru

30 Dec 2022

"Some rich people lost all their fortunes and became homeless" #StableDiffusion2 #AIart (Source: teddit.net/comments/zy9cmg)

273

2,484

413,423

hardmaru · Aug 15, 2019 · 7:45 AM UTC

hardmaru

@hardmaru

15 Aug 2019

Teams of high school students built bottle-flipping robots for RoboCon 2018 in Japan

800

2,474

hardmaru · Aug 29, 2023 · 9:00 PM UTC

hardmaru

@hardmaru

29 Aug 2023

Anti-hype LLM reading list. Pretty good list. gist.github.com/veekaybee/be…

456

2,382

228,694

hardmaru · Oct 4, 2022 · 4:50 AM UTC

hardmaru

@hardmaru

4 Oct 2022

New blog post: Collective Intelligence for Deep Learning Recently, @yujin_tang and I published a paper about how ideas like swarm behavior, self-organization, emergence are gaining traction in deep learning. I wrote a blog post summarizing the key ideas: blog.otoro.net/2022/10/01/co…

ALT Emergence of encirclement tactics in MAgent, a large scale multi-agent simulator.

448

2,322

hardmaru · Sep 25, 2019 · 10:44 PM UTC

hardmaru

@hardmaru

25 Sep 2019

Using gradient descent for everything nitter.app/danhett/status/1176116…

422

2,351

hardmaru · May 11, 2022 · 12:15 PM UTC

hardmaru

@hardmaru

11 May 2022

Interesting application combining computer vision and high-powered lasers to eradicate weeds on a farm.

Pascal Bornet

@pascal_bornet

10 May 2022

Unlike other weeding technologies, this #robot utilizes high-power lasers to eradicate weeds, without disturbing the soil... And, avoiding the use of herbicides! It leverages #AI to instantly identify and target weeds while rolling, days and night By Carbon Robotics #green

374

2,144

hardmaru · Jul 7, 2020 · 3:20 AM UTC

hardmaru

@hardmaru

7 Jul 2020

Also how deep learning models are trained on a MacBook

Vivek Verma

@vcubingx

5 Jul 2020

This is how I render my animations

274

2,103

hardmaru · Jun 16, 2021 · 2:06 PM UTC

hardmaru

@hardmaru

16 Jun 2021

Proof by meme?

274

2,115

hardmaru · May 13, 2022 · 12:08 AM UTC

hardmaru

@hardmaru

13 May 2022

Asked #Dalle to generate photographs of the bear stock market in the 1930s

346

2,042

hardmaru · Jun 12, 2019 · 12:13 AM UTC

hardmaru

@hardmaru

12 Jun 2019

Weight Agnostic Neural Networks 🦎 Inspired by precocial species in biology, we set out to search for neural net architectures that can already (sort of) perform various tasks even when they use random weight values. Article: weightagnostic.github.io PDF: arxiv.org/abs/1906.04358

626

2,060

hardmaru · Aug 16, 2020 · 2:55 AM UTC

hardmaru

@hardmaru

16 Aug 2020

Roman Emperor Project Using GAN-based tools to help create photorealistic portraits of Roman Emperors from historical references Project voshart.com/ROMAN-EMPEROR-PR… Article medium.com/@voshart/photorea…

516

2,003

hardmaru · Jul 9, 2022 · 1:03 PM UTC

hardmaru

@hardmaru

9 Jul 2022

MIT offers an excellent course on Deep Learning for Art, Aesthetics, and Creativity. All of the lecture videos are available on YouTube, with a fantastic list of speakers: ali-design.github.io/deepcre…

397

1,930

hardmaru · Jun 7, 2022 · 12:38 PM UTC

hardmaru

@hardmaru

7 Jun 2022

“Oriental Painting of Sun Tzu playing a game of Warcraft II on his desktop compute” generated using #Dalle

292

1,859

hardmaru · Sep 18, 2018 · 7:53 AM UTC

hardmaru

@hardmaru

18 Sep 2018

4 hours of baby play in 2 minutes

439

1,916

hardmaru · Sep 15, 2020 · 6:42 AM UTC

hardmaru

@hardmaru

15 Sep 2020

How do you skim a research paper? I usually read (in order): 1) abstract 2) 1st paragraph of the intro 3) last paragraph of intro (for contributions) 4) 1st paragraph of the conclusion (it's usually one paragraph anyways) 5) figures / tables of results, and read their captions.

316

1,909

hardmaru · Dec 21, 2018 · 2:35 AM UTC

hardmaru

@hardmaru

21 Dec 2018

A GAN trained on accepted @CVPR papers.

583

1,937

hardmaru · Oct 27, 2017 · 8:16 PM UTC

hardmaru

@hardmaru

27 Oct 2017

I'm blown away by the method and results in this paper. Progressive growing neural nets may be a trend we will see in 2018.

775

1,860

hardmaru · Jun 3, 2020 · 3:09 PM UTC

hardmaru

@hardmaru

3 Jun 2020

Machines see objects Humans see ideology

359

1,803

hardmaru · Dec 23, 2022 · 4:55 AM UTC

hardmaru

@hardmaru

23 Dec 2022

If Google doesn’t get their act together and start shipping, they will go down in history as the company who nurtured and trained an entire generation of machine learning researchers and engineers who went on to deploy the technology at other companies… The modern day Bell Labs.

西乔 XiQiao

@recatm

23 Dec 2022

Replying to @hardmaru

What exactly is google going to do with its AI research outcome is the biggest secret in the field 🤣

173

1,809

569,487

hardmaru · Sep 22, 2022 · 1:38 PM UTC

hardmaru

@hardmaru

22 Sep 2022

"Anime scene of Yann Lecun at Bell Labs working on convolutional neural networks." #StableDiffusion

ALT Prompt: "Anime scene of Yann Lecun at Bell Labs working on convolutional neural networks, Studio Ghibli, high quality, trending on Artstation."

136

1,792

hardmaru · Mar 23, 2024 · 10:08 AM UTC

hardmaru

@hardmaru

23 Mar 2024

I love the diversity of Tokyo’s urban architecture. This Tiny House is designed by Atelier Tekuto.

ALT Image Source: https://redd.it/hab7z9

121

1,764

100,909

hardmaru · Apr 12, 2020 · 1:08 AM UTC

hardmaru

@hardmaru

12 Apr 2020

RIP, John Conway.

654

1,777

hardmaru · May 26, 2024 · 7:14 PM UTC

hardmaru

@hardmaru

26 May 2024

Infinitely Recursive of Game of Life. Life and civilization emerges from self-organization at different levels of complexity.

saharan / さはら

352

1,769

267,989

hardmaru · Oct 1, 2020 · 8:45 AM UTC

hardmaru

@hardmaru

1 Oct 2020

The most well-dressed snowball fight, colorized using deep learning.

You’re unable to view this Post because this account owner limits who can view their Posts.

310

1,685

hardmaru · Jun 21, 2020 · 1:55 AM UTC

hardmaru

@hardmaru

21 Jun 2020

uh oh...

372

1,735

hardmaru · Oct 6, 2020 · 7:15 AM UTC

hardmaru

@hardmaru

6 Oct 2020

Neural network video streaming SDK from @NVIDIAAI can compress video conference data like these at ~0.1KB / frame, roughly 1000x better than H.264 (MPEG-4) compression on the same data (~100KB / frame).

429

1,645

hardmaru · Apr 30, 2019 · 6:10 AM UTC

hardmaru

@hardmaru

30 Apr 2019

A series of blog posts on applying machine learning to architecture Experiments: bit.ly/2XDhjtJ Background: bit.ly/2DIeWh4

476

1,647

hardmaru · May 5, 2020 · 5:51 AM UTC

hardmaru

@hardmaru

5 May 2020

PixelMe: Convert your photo into Pixel Art pixel-me.tokyo/en/ cloud.google.com/blog/produc…

364

1,651

hardmaru · Nov 6, 2021 · 6:10 PM UTC

hardmaru

@hardmaru

6 Nov 2021

Face to Anime using AnimeGANv2 teddit.net/r/MachineLearning…

@_akhaliq

6 Nov 2021

.@Gradio Demo for AnimeGANv2 Face Portrait v2 now on @huggingface Spaces demo: huggingface.co/spaces/akhali… github: github.com/bryandlee/animega…

342

1,640

hardmaru · May 16, 2019 · 12:47 AM UTC

hardmaru

@hardmaru

16 May 2019

A Line Rider on Beethoven's 5th Symphony

544

1,617

hardmaru · Nov 26, 2018 · 1:14 PM UTC

hardmaru

@hardmaru

26 Nov 2018

This amazing book on the foundations of machine learning is now available for free from Microsoft as a PDF download. I learned so much from this book over the years, and I feel that much of the material is still relevant. The solutions to the exercises also seem to be available!

Microsoft Research @MSFTResearchCam

26 Nov 2018

"Pattern Recognition and Machine Learning" by @ChrisBishopMSFT is now available as a free download. Download your copy today for an introduction to the fields of pattern recognition & machine learning: aka.ms/prml #ML #Insights

532

1,634

hardmaru · Dec 6, 2021 · 1:33 AM UTC

hardmaru

@hardmaru

6 Dec 2021

This must be where the Bayesians meet up on the weekends

This Post is from an account that no longer exists.

195

1,596

hardmaru · Jun 23, 2018 · 5:43 PM UTC

hardmaru

@hardmaru

23 Jun 2018

Papers with Code: A searchable site that links machine learning papers on ArXiv with code on GitHub. They also tag any framework libraries used, along with other info like GitHub stars. I think such a feature would be a nice addition to ArXiv-Sanity. paperswithcode.com/

610

1,608

hardmaru · Apr 21, 2020 · 12:25 AM UTC

hardmaru

@hardmaru

21 Apr 2020

Oil should learn to code.

257

1,542

hardmaru · Apr 29, 2020 · 4:13 AM UTC

hardmaru

@hardmaru

29 Apr 2020

Fooling Facial Detection with Fashion Nice article surveying common face detection methods, and tests practical implementations of adversarial patches on a face mask for fooling them. h/t @MelMitchell1 bit.ly/2VJbOLc github.com/BruceMacD/Adversa…

468

1,515

hardmaru · Dec 28, 2022 · 7:53 AM UTC

hardmaru

@hardmaru

28 Dec 2022

High resolution inpainting experiment with #StableDiffusion2 Transporting the famous Futaba Sushi restaurant in Ginza, Tokyo, to other cities, countries, planets, and finally, to a galaxy far far away… #StableDiffusion #AIart

279

1,556

851,619

hardmaru · Oct 31, 2020 · 12:34 PM UTC

hardmaru

@hardmaru

31 Oct 2020

A.I. camera mistakes referee’s bald head for ball, follows it through the match. teddit.net/comments/jlef67

From the MachineLearning community on Reddit: [N] AI camera mistakes referee's bald head for ball,...

Posted by nickelcore - 736 votes and 47 comments

reddit.com

308

1,496

hardmaru · Oct 10, 2018 · 12:21 AM UTC

hardmaru

@hardmaru

10 Oct 2018

Reinforcement Learning for Improving Agent Design: What happens when we let an agent learn a better body design together with learning its task? article: designrl.github.io/ pdf: arxiv.org/abs/1810.03779

405

1,527

hardmaru · Aug 9, 2020 · 2:16 PM UTC

hardmaru

@hardmaru

9 Aug 2020

One year after building rock solid technical infrastructure for your machine learning research project

This tweet is unavailable

193

1,483

hardmaru · May 26, 2020 · 1:31 AM UTC

hardmaru

@hardmaru

26 May 2020

“First Order Motion Model for Image Animation” hooked up to a live camera: github.com/anandpawara/Real_… Original NeurIPS2019 paper / code: aliaksandrsiarohin.github.io…

311

1,485

hardmaru · Dec 3, 2023 · 11:26 AM UTC

hardmaru

@hardmaru

3 Dec 2023

Being able to fool AI detection algorithms IRL will be an important survival skill in the 21st century 👻

Playfool @studioplayfool

2 Dec 2023

In other news, our latest work made its way onto Japanese national TV this week! 🚘

225

1,486

316,232

hardmaru · Oct 16, 2022 · 3:47 PM UTC

hardmaru

@hardmaru

16 Oct 2022

This #StableDiffusion add-on for Blender looks amazing. @AI_Render renders an AI-generated image based on a text prompt and your scene in Blender. github.com/benrugg/AI-Render

312

1,457

hardmaru · Jun 11, 2023 · 7:36 PM UTC

hardmaru

@hardmaru

11 Jun 2023

OpenAI, Google & Anthropic ban the use of the generated output content from their AI models to train other AI models, under their terms-of-service. However, they’ve been using other online content for their own model training. They can’t have it both ways. businessinsider.com/openai-g…

AI hypocrisy: OpenAI, Google and Anthropic won't let their data be used to train other AI models,...

Generative AI models are trained on a lot of online content. The companies behind these powerful models never asked for permission to do that.

businessinsider.com

325

1,441

300,414

hardmaru · Jun 13, 2023 · 4:15 PM UTC

hardmaru

@hardmaru

13 Jun 2023

Life before GPUs.

158

1,462

172,481

hardmaru · Jun 22, 2024 · 6:52 AM UTC

hardmaru

@hardmaru

22 Jun 2024

Language is primarily a tool for communication rather than thought nature.com/articles/s41586-0… “Language is a defining characteristic of our species, but the function, or functions, that it serves has been debated for centuries. Here we bring recent evidence from neuroscience and allied disciplines to argue that in modern humans, language is a tool for communication, contrary to a prominent view that we use language for thinking. We begin by introducing the brain network that supports linguistic ability in humans. We then review evidence for a double dissociation between language and thought, and discuss several properties of language that suggest that it is optimized for communication. We conclude that although the emergence of language has unquestionably transformed human culture, language does not appear to be a prerequisite for complex thought, including symbolic thought. Instead, language is a powerful tool for the transmission of cultural knowledge; it plausibly co-evolved with our thinking and reasoning capacities, and only reflects, rather than gives rise to, the signature sophistication of human cognition.”

Language is primarily a tool for communication rather than thought

Nature - Evidence from neuroscience and related fields suggests that language and thought processes operate in distinct networks in the human brain and that language is optimized for communication...

nature.com

311

1,434

644,768

hardmaru · May 22, 2023 · 3:15 AM UTC

hardmaru

@hardmaru

22 May 2023

LIMA, a 65B LLaMa fine-tuned only with supervised learning on 1000 curated examples, without any RLHF, demonstrates remarkably strong performance, generalizes well to unseen tasks not in training data. Comparable to GPT-4, Bard, DaVinc003 in human studies.teddit.net/r/MachineLearning…

226

1,464

591,432

hardmaru · May 13, 2019 · 5:36 AM UTC

hardmaru

@hardmaru

13 May 2019

An interactive article explaining why weight initialization is so important for training neural nets by @deeplearningai_, written in the distill.pub format. deeplearning.ai/ai-notes/ini…

459

1,478

hardmaru · Dec 15, 2021 · 6:48 AM UTC

hardmaru

@hardmaru

15 Dec 2021

With the right body, no brain is needed.

MachinePix

228

1,430

hardmaru · Sep 15, 2021 · 6:41 AM UTC

hardmaru

@hardmaru

15 Sep 2021

Uber developed a system of nested hexagons to represent space, called H3: eng.uber.com/h3/

183

1,432

hardmaru · Aug 2, 2017 · 3:48 PM UTC

hardmaru

@hardmaru

2 Aug 2017

how probability distributions are related

461

1,416

hardmaru · Mar 2, 2020 · 12:59 AM UTC

hardmaru

@hardmaru

2 Mar 2020

Jupyter notebooks with Python examples for reproducing examples from each chapter of Christopher Bishop's “Pattern Recognition and Machine Learning” textbook (also available for free in link above) github.com/ctgk/PRML

405

1,425

hardmaru · Dec 20, 2021 · 10:10 PM UTC

hardmaru

@hardmaru

20 Dec 2021

I’m super excited to see ideas from complex systems such as swarm intelligence, self-organization, and emergent behavior gain traction again in AI research. We wrote a survey of recent developments that combine ideas from deep learning and complex systems: arxiv.org/abs/2111.14377

284

1,413

hardmaru · Aug 12, 2018 · 10:10 PM UTC

hardmaru

@hardmaru

12 Aug 2018

Academic Torrents is a distributed system for sharing enormous datasets. So far they have made 27.23TB of research data available. academictorrents.com

517

1,415

hardmaru · Jul 12, 2022 · 1:16 AM UTC

hardmaru

@hardmaru

12 Jul 2022

The map of the brain, created by an aerospace engineer. These are the result of six years of research. It’s always interesting to me to view the perspective of one challenging scientific field through the lens of an expert from another field. 🧠 Source: thehighestofthemountains.com…

302

1,378

hardmaru · Sep 4, 2024 · 6:09 AM UTC

hardmaru

@hardmaru

4 Sep 2024

Excited to announce our Series A! We raised more than $100M to grow Sakana AI into a World Class AI Lab in Japan. We’re going to really push the frontiers of what’s possible with AI. As a founder mode startup, we operate much faster than most frontier AI labs at a global level.

Sakana AI

@SakanaAILabs

4 Sep 2024

Announcing Our Series A sakana.ai/series-a

117

101

1,439

233,500

hardmaru · Feb 21, 2025 · 5:49 PM UTC

hardmaru

@hardmaru

21 Feb 2025

After many years, this guy has come back to haunt me.

hardmaru

@hardmaru

11 Oct 2018

Replying to @hardmaru

If we remove all design constraints, the optimizer came up with a really tall bipedal walker robot that “solves” the task by simply falling over and landing near the exit.

1,443

110,062

hardmaru · Aug 6, 2020 · 5:51 AM UTC

hardmaru

@hardmaru

6 Aug 2020

Self-attention mechanism can be viewed as the update rule of a Hopfield network with continuous states. Deep learning models can take advantage of Hopfield networks as a powerful concept comprising pooling, memory, and attention. arxiv.org/abs/2008.02217 github.com/ml-jku/hopfield-l…

349

1,399

hardmaru · May 25, 2022 · 5:42 AM UTC

hardmaru

@hardmaru

25 May 2022

i feel seen

143

1,388

hardmaru · Jul 22, 2017 · 1:04 AM UTC

hardmaru

@hardmaru

22 Jul 2017

Decoding the Enigma with RNNs. They trained a LSTM with 3000 hidden units to decode ciphertext with 96%+ accuracy. greydanus.github.io/2017/01/…

735

1,394

hardmaru · Mar 4, 2019 · 8:01 AM UTC

hardmaru

@hardmaru

4 Mar 2019

Using deep learning to implement linear regression

MachinePix

@MachinePix

3 Mar 2019

A skilled excavator operator with a Engcon EC206 tiltrotator.

291

1,350

hardmaru · Jan 18, 2020 · 12:46 PM UTC

hardmaru

@hardmaru

18 Jan 2020

Edo period cat meme

466

1,327

hardmaru · Apr 6, 2020 · 2:09 AM UTC

hardmaru

@hardmaru

6 Apr 2020

Dive into Deep Learning: An interactive deep learning book with code, math, and discussions, based on the NumPy interface. I really like the format of the textbook! d2l.ai/

324

1,368

hardmaru · Oct 29, 2021 · 12:16 AM UTC

hardmaru

@hardmaru

29 Oct 2021

MANGA sounds better than FAANG

401

1,367

hardmaru · Sep 23, 2023 · 8:40 AM UTC

hardmaru

@hardmaru

23 Sep 2023

TinyML and Efficient Deep Learning Computing MIT 6.5940 (efficientml.ai) “This course will introduce efficient AI computing techniques that enable powerful deep learning applications on resource-constrained devices. Topics include model compression, pruning, quantization, neural architecture search, distributed training, data/model parallelism, gradient compression, and on-device fine-tuning. It also introduces application-specific acceleration techniques for large language models, diffusion models, video recognition, and point cloud. This course will also cover topics about quantum machine learning. Students will get hands-on experience deploying large language models (e.g., LLaMA 2) on a laptop.”

215

1,349

239,527

hardmaru · Oct 19, 2020 · 1:16 AM UTC

hardmaru

@hardmaru

19 Oct 2020

Conventional thinking: Build a robot to solve the problem. Out-of-the-box thinking: Get the problem to solve itself. Example: Self-solving Rubik's Cube by @takashikaburagi

278

1,336