Matthew Barnett · Sep 27, 2025 · 7:14 PM UTC

Matthew Barnett

Matthew Barnett

@MatthewJBar

27 Sep 2025

Twin and adoption studies consistently show that parenting choices have minimal effects on a kid's eventual intelligence, personality, or happiness (except in cases of extreme neglect or abuse). This should revolutionize how we raise children, yet almost nobody knows or cares.

136

126

1,415

232,248

Matthew Barnett · Jul 19, 2023 · 6:40 AM UTC

Matthew Barnett

@MatthewJBar

19 Jul 2023

A new LLM truthfulness benchmark just dropped. (Context: Alabama in fact has a higher per capita GDP than Japan.)

1,163

249,431

Matthew Barnett · Dec 20, 2022 · 10:26 PM UTC

Matthew Barnett

@MatthewJBar

20 Dec 2022

There's been a lot of low quality GPT-4 speculation recently. So, here's a relatively informed GPT-4 speculation thread from an outsider who still doesn't know that much. 🧵

181

1,181

614,647

Matthew Barnett · Aug 19, 2025 · 8:04 PM UTC

Matthew Barnett

@MatthewJBar

19 Aug 2025

If you give people cash and they choose to spend it on a bunch of junk rather than on education or healthcare, one way to interpret that result is that marginal spending on education and healthcare is worth less than a bunch of junk.

The Argument

@TheArgumentMag

19 Aug 2025

The "cash can replace" a strong social safety net people taking a real L this morning.

888

56,530

Matthew Barnett · Feb 17, 2023 · 11:16 PM UTC

Matthew Barnett

@MatthewJBar

17 Feb 2023

I have no dislike for philosophers, but the profession did not prepare us well for AI. The field is full of muddled thinking, abysmal takes like the Chinese Room Argument, a focus on pointless vague inquiries over big picture questions, and is often detached from actual AI.

536

112,615

Matthew Barnett · Dec 26, 2024 · 3:44 PM UTC

Matthew Barnett

@MatthewJBar

26 Dec 2024

I wish people made their predictions falsifiable. Robin Hanson has been saying that the current AI boom will bust since at least 2016, but AI has rapidly gotten better over that entire time frame, with correspondingly more investment and attention. When can we say he was "wrong"?

Robin Hanson

@robinhanson

25 Dec 2024

Replying to @robinhanson

The current burst of AI activity will likely fade, as have many bursts before, before the future burst when AI actually takes over the world. Something else will be the "big thing" between future AI bursts.

453

28,837

Matthew Barnett · Aug 4, 2023 · 7:12 PM UTC

Matthew Barnett

@MatthewJBar

4 Aug 2023

I personally think $20 a month is cheap when the benefit is knowing whether a fundamental claim in your field is valid (and in this case, the claim is approximately not valid).

428

63,894

Matthew Barnett · Nov 15, 2025 · 9:28 AM UTC

Matthew Barnett

@MatthewJBar

15 Nov 2025

Why do so many people think that humans don't trade with animals because we're way more powerful than them, rather than because they can't talk or keep agreements? Do they think cats, mice, and ants really couldn't do anything useful for us if we could coordinate with them?

438

39,138

Matthew Barnett · Mar 29, 2023 · 5:13 AM UTC

Matthew Barnett

@MatthewJBar

29 Mar 2023

I currently think this open letter is quite bad, and possibly net harmful. The proposed policy appears vague and misguided. I want to explain some of my thoughts. 🧵 futureoflife.org/open-letter…

Pause Giant AI Experiments: An Open Letter - Future of Life Institute

We call on all AI labs to immediately pause for at least 6 months the training of AI systems more powerful than GPT-4.

futureoflife.org

392

181,534

Matthew Barnett · Oct 3, 2023 · 12:35 AM UTC

Matthew Barnett

@MatthewJBar

3 Oct 2023

Why are some people still treating AGI as a thing that will at some point "be invented"? At this point, doesn't it seem pretty clear that AIs will just get continuously more general and capable with no clear finish line?

372

41,047

Matthew Barnett · Apr 6, 2022 · 6:37 PM UTC

Matthew Barnett

@MatthewJBar

6 Apr 2022

Most rankings of the top causes of death can be misleading, since they count someone dying at 20 the same as someone dying at 90. When you weight by life-years lost, you get this ranking (as of 2015). From ncbi.nlm.nih.gov/pmc/article…

350

Matthew Barnett · Mar 25, 2023 · 10:33 PM UTC

Matthew Barnett

@MatthewJBar

25 Mar 2023

How it feels to read stories about how an AGI can take over the world.

353

36,862

Matthew Barnett · Aug 21, 2025 · 5:40 AM UTC

Matthew Barnett

@MatthewJBar

21 Aug 2025

It's frustrating when people say "AI progress is too fast" while over 100,000 people still die from aging per day, with no sign of abating. It's like we're in a huge, deadly war and people say our leaders are rushing to agree to a peace settlement. No, they should go even faster.

330

79,450

Matthew Barnett · May 9, 2023 · 9:48 PM UTC

Matthew Barnett

@MatthewJBar

9 May 2023

Here's a line of reasoning for AI doom I've seen before that seems bad: 1. The first AGI will be able to end the world via nanotech 2. I can't explain exactly how it could do that 3. But (2) doesn't matter, because an AGI will be much smarter than me, and will figure it out

302

62,784

Matthew Barnett · Mar 15, 2023 · 7:12 PM UTC

Matthew Barnett

@MatthewJBar

15 Mar 2023

I graded GPT-4's responses on @bryan_caplan's economics midterm—the same one that ChatGPT got a D on—and it got an A. I don't think GPT-4 is at human-level yet across a wide range of tasks, but I'm feeling good about my bet right now. matthewbarnett.substack.com/…

GPT-4 takes Bryan Caplan's midterm and gets an A

On January 9th, a little over two months ago, Bryan Caplan wrote,

matthewbarnett.substack.com

300

97,440

Matthew Barnett · Jun 30, 2023 · 9:47 PM UTC

Matthew Barnett

@MatthewJBar

30 Jun 2023

I admit I'm confused why some people think there's a fundamental barrier deep learning still needs to break through before obtaining "real intelligence". I understand thinking that in 2021, but how could you say that after talking to GPT-4 for an hour?

309

144,170

Matthew Barnett · Apr 10, 2023 · 12:05 AM UTC

Matthew Barnett

@MatthewJBar

10 Apr 2023

A popular idea in AI risk literature until recently was the idea that AIs would very quickly go from below human-level to above human-level intelligence. As Nick Bostrom put it, "The train doesn't stop at Humanville Station. It's likely, rather, to swoosh right by."

295

98,211

Matthew Barnett · May 2, 2023 · 6:10 PM UTC

Matthew Barnett

@MatthewJBar

2 May 2023

I also used to think this, and it was one of the reasons why I had long AI timelines. But I changed my mind. Existing evidence suggests that technologies are getting adopted much faster now. And we know that ChatGPT was adopted very fast compared to e.g. electricity.

Yann LeCun

@ylecun

2 May 2023

Every economist I know says that it takes 15 to 20 years before a new general purpose technology has a measurable effect on productivity. The delay is determined by how fast people learn to use it. So no, AI is not going to cause instant mass unemployment. It's going to displace jobs over time and make people more productive, just like every other technological revolution before that.

288

68,567

Matthew Barnett · Nov 10, 2024 · 12:26 AM UTC

Matthew Barnett

@MatthewJBar

10 Nov 2024

This is a good time to reflect on the "AI effect". Before a benchmark is solved, people often think we'll need "real AGI" to solve it. Then, afterwards, we realize the benchmark can be solved using mere tricks. Will this benchmark fall in the same way? Honestly, I'm not sure.🧵

Epoch AI

@EpochAIResearch

8 Nov 2024

1/10 Today we're launching FrontierMath, a benchmark for evaluating advanced mathematical reasoning in AI. We collaborated with 60+ leading mathematicians to create hundreds of original, exceptionally challenging math problems, of which current AI systems solve less than 2%.

330

127,849

Matthew Barnett · Jun 28, 2025 · 11:02 PM UTC

Matthew Barnett

@MatthewJBar

28 Jun 2025

I genuinely think "consciousness" is simply the modern, secular term for "soul". Both refer to unfalsifiable concepts used to determine who is in or out of our moral ingroup. Neither are empirical designations discovered through experiment, but socially constructed categories.

303

39,059

Matthew Barnett · Sep 3, 2023 · 11:59 PM UTC

Matthew Barnett

@MatthewJBar

3 Sep 2023

If or when effective anti-aging therapies are developed, I predict most people will sign up, including the guy I'm quote-tweeting. The rationalizations people come up with for aging and death are flimsier than a house of cards in a gusty wind.

Christian Keil

@pronounced_kyle

3 Sep 2023

I genuinely don't understand the desire to live to be 120 years old. We unlocked the secret to eternal life a few millennia ago: just get married and have babies. Then you live on through your kids.

290

25,701

Matthew Barnett · Mar 3, 2023 · 3:54 AM UTC

Matthew Barnett

@MatthewJBar

3 Mar 2023

Replying to @AlexGodofsky

Being an activist for a cause doesn't mean you need to support everything that helps the cause, including things that have large costs in other ways. I don't think going to this protest reveals a large inconsistency in Greta's behavior.

224

9,956

Matthew Barnett · Oct 8, 2025 · 5:29 PM UTC

Matthew Barnett

@MatthewJBar

8 Oct 2025

From my POV, I am trying to *save everyone's lives* by accelerating AI. My view is that AI will accelerate medical cures that could save the lives of billions. Delaying AI therefore risks killing billions of people. I am on the side of life, not death. I want us all to live.

252

22,614

Matthew Barnett · May 23, 2023 · 6:48 PM UTC

Matthew Barnett

@MatthewJBar

23 May 2023

At some point in the next 5 years, I expect people will create a giant AI-generated encyclopedia that has lower peak quality, but higher average quality than Wikipedia. This will potentially do to Wikipedia what Wikipedia did to Encyclopedia Britannica.

228

55,711

Matthew Barnett · May 21, 2025 · 1:36 AM UTC

Matthew Barnett

@MatthewJBar

21 May 2025

I want to highlight that @DKokotajlo has been polite and focused on object-level points in just about every discussion that I can remember with him, including when we vehemently disagreed. I appreciate this, as it seems like a surprisingly rare and undervalued personality trait.

253

6,729

Matthew Barnett · Apr 18, 2022 · 3:52 AM UTC

Matthew Barnett

@MatthewJBar

18 Apr 2022

I know "the Nazis were really bad" is not an interesting or original take. But I'm continuously shocked at how terrible they were. It's like each time I learn more about them, my opinion of them drops even further.

214

Matthew Barnett · Mar 4, 2023 · 10:33 PM UTC

Matthew Barnett

@MatthewJBar

4 Mar 2023

Something that surprised me last year regarding LLMs was their ability to do mathematics well. I now suspect that mathematics is not much harder for computers to understand than ordinary natural language documents. This has pretty interesting implications. 🧵

238

111,624

Matthew Barnett · Feb 20, 2022 · 8:02 PM UTC

Matthew Barnett

@MatthewJBar

20 Feb 2022

The foom debate in three parts. 1/3

231

Matthew Barnett · Oct 1, 2022 · 7:30 PM UTC

Matthew Barnett

@MatthewJBar

1 Oct 2022

I want to know how seriously to take this study. It suggests that dictators routinely lie about GDP data by large amounts. If true, it would indicate that the world is a lot poorer than the statistics show. (The paper has yet to be published.) archive.ph/GVCgW

228

Matthew Barnett · Jul 21, 2024 · 10:20 PM UTC

Matthew Barnett

@MatthewJBar

21 Jul 2024

It's interesting for me to see many replies to this tweet arguing that he's wrong. I personally think this is a banal philosophical thesis. What makes so many people think that silicon cannot host conscious minds in the same way that biology can?

Tsarathustra @tsarnick

20 Jul 2024

David Chalmers says it is possible for an AI system to be conscious because the brain itself is a machine that produces consciousness, so we know this is possible in principle

224

20,652

Matthew Barnett · Apr 10, 2023 · 12:05 AM UTC

Matthew Barnett

@MatthewJBar

10 Apr 2023

In my opinion, currently progress in language models makes this picture look false. Right now, LLMs seem to be incrementally moving through the human range of abilities for various general intellectual tasks without any sudden cross-domain jumps in power.

229

24,915

Matthew Barnett · Dec 20, 2022 · 10:26 PM UTC

Matthew Barnett

@MatthewJBar

20 Dec 2022

Combining these assumptions, I estimate that the total training compute for GPT-4 will be between 2.54 billion petaFLOP to 130 billion petaFLOP, with a central estimate of 18 billion petaFLOP. For comparison, that's roughly 1-50 times more compute than PaLM.

228

53,435

Matthew Barnett · Dec 7, 2022 · 8:01 PM UTC

Matthew Barnett

@MatthewJBar

7 Dec 2022

OpenAI just updated their audio transcriber, Whisper. I just tried it out. It's very close to human-level in my test. You should consider using it as an alternative to human transcription. github.com/openai/whisper

GitHub - openai/whisper: Robust Speech Recognition via Large-Scale Weak Supervision

Robust Speech Recognition via Large-Scale Weak Supervision - openai/whisper

github.com

230

Matthew Barnett · Jan 20, 2024 · 9:59 PM UTC

Matthew Barnett

@MatthewJBar

20 Jan 2024

Putting aside how interesting it would be, the dark forest hypothesis seems very weak. Why wouldn't hostile aliens just send space probes to every star system and monitor planets closely? Somehow they only care enough to eliminate "loud" competition?

Nelson D'Silva 🔰 🏗@one_more_EIS

20 Jan 2024

This has supplanted New Chronology as the worldview I admire the inventiveness of the most without believing any of.

216

49,265

Matthew Barnett · Nov 25, 2023 · 5:31 AM UTC

Matthew Barnett

@MatthewJBar

25 Nov 2023

Question for AI pessimists: suppose an AI is released that is clearly better than the top human mathematicians at math. It can also write long, coherent books comparable to the best human authors. Three months pass and the world does not end. How do you update on p(doom)?

217

79,850

Matthew Barnett · Jul 4, 2025 · 11:30 PM UTC

Matthew Barnett

@MatthewJBar

4 Jul 2025

"Superintelligence" seems overrated. o3 is already quite intelligent: it can do math, write code, and understand research. Yet, most people would probably find greater value in a robot that cleans their room. What matters is super-useful AI, not necessarily superintelligent AI.

222

15,856

Matthew Barnett · May 23, 2025 · 12:49 AM UTC

Matthew Barnett

@MatthewJBar

23 May 2025

I'm frustrated by the negativity towards Anthropic on my feed today. Personally, I think they're doing great work. They're showing how to be responsible while swiftly advancing AI capabilities. Ironically, they're criticized for both of these things, but I appreciate both.

221

10,154

Matthew Barnett · Nov 29, 2023 · 4:22 AM UTC

Matthew Barnett

@MatthewJBar

29 Nov 2023

I think there is something true about @robinhanson's thesis that fear of AI is often just fear of the future. For example, here's @KatjaGrace sharing her worries about what may happen even if AI doesn't kill everyone.

204

27,866

Matthew Barnett · Mar 21, 2022 · 11:25 PM UTC

Matthew Barnett

@MatthewJBar

21 Mar 2022

"Alexey Guzey’s Theses on Sleep gained a lot of popularity and acclaim on LessWrong and among people I follow on social media, despite largely consisting of what I think were weak arguments and misleading claims." lesswrong.com/posts/sbcmACvB…

LessWrong

A community blog devoted to refining the art of rationality

lesswrong.com

214

Matthew Barnett · Apr 12, 2023 · 9:19 PM UTC

Matthew Barnett

@MatthewJBar

12 Apr 2023

"I think my students can stop worrying that their hard-won skills and knowledge will be outstripped by an AI program anytime soon." Will Steve Landsburg put his money where his mouth is? I'm happy to bet him that an AI will score As on his exams before 2028 >75% of the time.

Alex Tabarrok

@ATabarrok

12 Apr 2023

GPT4 gets a 0 on Steven Landsbrug's undergrad econ exam. But damn, Steven's questions are tricky. Not hard computationally but you really have to think like an economist! thebigquestions.com/2023/04/…

214

50,850

Matthew Barnett · Oct 5, 2023 · 6:54 PM UTC

Matthew Barnett

@MatthewJBar

5 Oct 2023

I wrote a LessWrong post about why I think some MIRI people (@ESYudkowsky, @So8res, and @robbensinger) should probably update on alignment being easier than they expected in light of the fact that LLMs seem to follow directions well and act morally. lesswrong.com/posts/i5kijcjF…

Evaluating the historical value misspecification argument — LessWrong

ETA: I'm not saying that MIRI thought AIs wouldn't understand human values. If there's only one thing you take away from this post, please don't take…

lesswrong.com

204

44,558

Matthew Barnett · Jul 12, 2022 · 6:50 PM UTC

Matthew Barnett

@MatthewJBar

12 Jul 2022

A reminder that Baumol's cost disease is poorly named. It's not a disease. It's a side effect of unequal productivity growth across sectors. The increasing cost of services means that we're richer on average, not poorer, than before.

208

Matthew Barnett · Jul 20, 2025 · 2:24 AM UTC

Matthew Barnett

@MatthewJBar

20 Jul 2025

We are offering a $500k base salary for this role. That's not total compensation: we're paying equity on top of the $500k. If you know any highly experienced software engineers who might be a good fit, please reach out. It's totally fine if they don't have any experience in ML.

Mechanize

@MechanizeWork

20 Jul 2025

We're hiring software engineers. $500k base.

207

55,964

Matthew Barnett · Jun 9, 2025 · 1:23 AM UTC

Matthew Barnett

@MatthewJBar

9 Jun 2025

Here's a prediction from February 2023 that we can now evaluate. Has the pace of AI progress over the past few years felt "intuitively nuts" to you? Personally, I don't think so. AI had a big moment with ChatGPT and GPT-4, but the 2 years since then felt mostly incremental to me.

Jack Clark

@jackclarkSF

12 Feb 2023

A mental model I have of AI is it was roughly ~linear progress from 1960s-2010, then exponential 2010-2020s, then has started to display 'compounding exponential' properties in 2021/22 onwards. In other words, next few years will yield progress that intuitively feels nuts.

192

36,308

Matthew Barnett · Sep 27, 2025 · 8:14 PM UTC

Matthew Barnett

@MatthewJBar

27 Sep 2025

Replying to @eshear

Your rebuttal to decades of consistently replicated research about the role of genes in human behavior is to cite a single example where genes presumably played a very substantial role?

178

6,912

Matthew Barnett · Feb 20, 2024 · 4:06 AM UTC

Matthew Barnett

@MatthewJBar

20 Feb 2024

In my opinion, we appear close to achieving this milestone that @GaryMarcus described in 2014, which he described as a "Turing Test for the twenty-first century".

Google DeepMind

@GoogleDeepMind

19 Feb 2024

Gemini 1.5 Pro can perform highly-sophisticated understanding & reasoning tasks for different modalities, including video. 📹 When given a 44-minute silent Buster Keaton film, it can analyze various plot points, and even reason about small details that could easily be missed. ↓

Multimodal prompting with a 44-minute movie | Gemini 1.5 Pro Demo

A demo of long context understanding, an experimental feature in our newest model, Gemini 1.5 Pro using a 44-minute silent Buster Keaton movie, Sherlock Jr., and a series of multimodal prompts.

189

37,858

Matthew Barnett · Jun 19, 2023 · 12:09 AM UTC

Matthew Barnett

@MatthewJBar

19 Jun 2023

The argument that the internet didn't impact GDP much because we've kept to a ~1.5% yearly per capita growth trend since 1990 seems weak to me. Perhaps without the internet, we would have had 0.5% yearly growth instead. How can you infer the counterfactual from the trend?

169

28,138

Matthew Barnett · May 15, 2023 · 10:20 PM UTC

Matthew Barnett

@MatthewJBar

15 May 2023

I outline what I currently consider to be the most plausible AI doom story here: lesswrong.com/posts/MnrQMLuE…

182

107,812

Matthew Barnett · Jun 6, 2023 · 3:25 AM UTC

Matthew Barnett

@MatthewJBar

6 Jun 2023

Many AI risk arguments focus on showing that AIs could take control in a sudden, violent takeover. But I think we're already going to be giving AIs control of our civilization by default. We're going to give up the keys voluntarily. A dramatic takeover event isn't necessary.

181

19,672

Matthew Barnett · Jul 1, 2024 · 5:40 AM UTC

Matthew Barnett

@MatthewJBar

1 Jul 2024

I think there’s sometimes a motte and bailey in these discussions. I think smarter-than-human AIs are clearly possible. But I’m skeptical of the premise that they will quickly turn into near-omnipotent gods with seemingly unlimited powers of persuasion and deception.

Ab Homine Deus

@AbHomineDeus

30 Jun 2024

Saying "I don't believe in ASI" is just the most insane cope. Let's say Einstein-level intelligence truly is some sort of universal intelligence speed limit. What do you think 1000s of Einstein's thinking together thousands of times faster than humanly possible looks like?

175

14,999

Matthew Barnett · Apr 8, 2023 · 8:44 PM UTC

Matthew Barnett

@MatthewJBar

8 Apr 2023

I think some people underrate the possibility that we don't need to understand how neural networks work in order to align them. We manage to align humans and domesticated animals reasonably well, even though we don't fully understand how their brains work.

174

26,268

Matthew Barnett · Feb 8, 2023 · 12:49 AM UTC

Matthew Barnett

@MatthewJBar

8 Feb 2023

I think I've uncovered an error in @ESYudkowsky's book Inadequate Equilibria that undermines a key point in the book. See the full thread for details.🧵

175

53,610

Matthew Barnett · Jan 11, 2023 · 2:43 AM UTC

Matthew Barnett

@MatthewJBar

11 Jan 2023

.@bryan_caplan says he is not impressed by ChatGPT, as it scored a D on his labor economics exam. But does he expect the technology to improve? I'd be happy to bet him that language models will consistently earn A's on his exams before 2028. betonit.substack.com/p/chatg…

165

55,970

Matthew Barnett · Dec 16, 2023 · 9:46 PM UTC

Matthew Barnett

@MatthewJBar

16 Dec 2023

The first image is from @ESYudkowsky in 2016. I think this prediction is clearly becoming increasingly untenable. GPT-4 seems to have a fair degree of situational awareness, can pursue goals to help us, and yet doesn't resist shutdown by default.

173

69,977

Matthew Barnett · May 18, 2025 · 8:46 PM UTC

Matthew Barnett

@MatthewJBar

18 May 2025

I want to pre-register that I mostly agree with the scenario depicted in AI 2027 up until about 2029. I expect the essay's predictions to be falsified around 2029–2030 when our economy has not yet been ~fully automated. However, until that point, the essay appears reasonable.

171

16,817

Matthew Barnett · Nov 10, 2024 · 12:26 AM UTC

Matthew Barnett

@MatthewJBar

10 Nov 2024

The first thing to understand about FrontierMath is that it's genuinely extremely hard. Almost everyone on Earth would score approximately 0%, even if they're given a full day to solve *each* problem. For fun, here's what a few people on Reddit said after looking at the problems.

178

36,500

Matthew Barnett · Oct 3, 2023 · 8:00 AM UTC

Matthew Barnett

@MatthewJBar

3 Oct 2023

I sincerely wish for people to more frequently update their understanding of things like AI risk and AI takeoff as we get more info about the technology. I still see a lot of people stuck in frameworks that made sense in 2013 but not 2023. Please try harder.

163

36,815

Matthew Barnett · Apr 2, 2024 · 5:02 AM UTC

Matthew Barnett

@MatthewJBar

2 Apr 2024

On a basic level, I find it pretty suspicious that a large fraction of EA (definitely not all of it) has converged onto the position that the best way of ensuring we get to the vast, vibrant post-human future is to shut down the ~only technology capable of taking us there.

156

44,265

Matthew Barnett · Nov 23, 2023 · 8:58 PM UTC

Matthew Barnett

@MatthewJBar

23 Nov 2023

To those who think AGI will merely continue 2-6% GDP growth, would you say the same about a technology that allowed for extremely fast ordinary population growth, e.g. a machine that near-instantly duplicated humans for $30,000 each?

157

13,572

Matthew Barnett · Aug 2, 2023 · 6:30 AM UTC

Matthew Barnett

@MatthewJBar

2 Aug 2023

AGI would be a far more important innovation than room temperature superconductors. With AGI, people could have armies of servants that obey their every whim, and who are much smarter than them. R&D could be automated. It's not a close competition.

Ben Landau-Taylor

@benlandautaylor

2 Aug 2023

I can't help notice how the excitement over the mere *possibility* of an actual no-shit physical tech breakthrough makes all the years of hype around chatbots and game-playing robots look kinda shallow and forced by comparison. benlandautaylor.com/2023/05/…

154

22,401

Matthew Barnett · Mar 19, 2025 · 11:05 PM UTC

Matthew Barnett

@MatthewJBar

19 Mar 2025

While I appreciate this study, I'm also a bit worried its headline result is misleading—it only measures performance on a narrow set of software tasks. As of March 2025, AIs still can't handle 15-minute robotics or computer-use tasks, despite what the headline plot might suggest.

METR

@METR_Evals

19 Mar 2025

When will AI systems be able to carry out long projects independently? In new research, we find a kind of “Moore’s Law for AI agents”: the length of tasks that AIs can do is doubling about every 7 months.

153

19,780

Matthew Barnett · Mar 29, 2023 · 5:13 AM UTC

Matthew Barnett

@MatthewJBar

29 Mar 2023

The open letter proposes that we prohibit giant training runs, possibly by law, but explicitly allow algorithmic progress. This would create a "hardware overhang" in which a discontinuous capability increase becomes more likely if these constraints are ever lifted.

147

17,646

Matthew Barnett · Dec 21, 2022 · 10:25 AM UTC

Matthew Barnett

@MatthewJBar

21 Dec 2022

Replying to @rgblong

GPT-3.5 was reportedly finished training in early 2022. Since I estimated that GPT-4 could be trained for up to 12 months, and they likely need to fine-tune and test it, my guess is that we're looking at a release in the early months of 2023, with maybe a median of March.

145

20,745

Matthew Barnett · Mar 31, 2024 · 1:07 AM UTC

Matthew Barnett

@MatthewJBar

31 Mar 2024

It's interesting that we actually got something like a 6 month pause that FLI was asking for in their open letter. Nothing in the last 12 months has meaningfully surpassed GPT-4. How much safety benefit did we get from this unforced pause?

142

41,368

Matthew Barnett · Aug 10, 2025 · 10:46 PM UTC

Matthew Barnett

@MatthewJBar

10 Aug 2025

Every consumer good has consumer surplus, so this explanation is too general to explain much about AI in particular. A better explanation for why AI isn't meaningfully showing up in GDP is that AI has simply had a relatively small impact on economic production so far.

Noam Brown

@polynoamial

10 Aug 2025

Really interesting article. Why isn't the impact of AI showing up in GDP? Because most of the benefit accrues to consumers. To measure impact, they investigate how much people would *need to be paid to give up a good*, rather than what they pay for it.

145

14,891

Matthew Barnett · Sep 27, 2025 · 9:00 PM UTC

Matthew Barnett

@MatthewJBar

27 Sep 2025

Replying to @eshear

I'm not claiming that because (1) I'm talking about intelligence, personality, and happiness, not trained skills like tennis, (2) this is about statistical regularities, not sweeping claims about every case, and (3) the unique environment makes counterfactuals difficult to infer.

136

3,752

Matthew Barnett · Jun 30, 2022 · 7:42 PM UTC

Matthew Barnett

@MatthewJBar

30 Jun 2022

I haven't read much of the paper yet, but this looks to me like one of the most important AI results of the year. ai.googleblog.com/2022/06/mi…

Minerva: Solving Quantitative Reasoning Problems with Language Models

Posted by Ethan Dyer and Guy Gur-Ari, Research Scientists, Google Research, Blueshift Team Language models have demonstrated remarkable performance...

research.google

142

Matthew Barnett · Sep 21, 2023 · 2:25 AM UTC

Matthew Barnett

@MatthewJBar

21 Sep 2023

If it were up to a vote, the public might also ban genetically modified foods. Thankfully, our institutions rely more on expert assessments of risk than general opinion. We should probably do the same for AGI.

Connor Leahy

@NPCollapse

20 Sep 2023

The public continues to be very clear about what it thinks about AGI.

139

54,969

Matthew Barnett · Dec 27, 2022 · 7:20 PM UTC

Matthew Barnett

@MatthewJBar

27 Dec 2022

Some improvements we might start to see more in large language models within 2 years: - Explicit memory that will allow it to retrieve documents and read them before answering questions arxiv.org/abs/2112.04426

140

22,988

Matthew Barnett · Mar 4, 2023 · 9:37 PM UTC

Matthew Barnett

@MatthewJBar

4 Mar 2023

Replying to @LaraThurnherr

I feel like anyone who said "writing interesting stories" was nowhere near solved in January 2021 simply wasn't paying attention to recent progress.

141

6,557

Matthew Barnett · Dec 9, 2023 · 12:42 AM UTC

Matthew Barnett

@MatthewJBar

9 Dec 2023

I mostly don't think Gemini is impressive for Google. The very modest improvements over GPT-4, which finished pre-training in August 2022, suggest that Google is still underrating the importance of hardware, lacks crucial engineering talent, or both.

129

17,260

Matthew Barnett · Jul 19, 2023 · 6:40 AM UTC

Matthew Barnett

@MatthewJBar

19 Jul 2023

Sources: You can get the GDP of Alabama from the BEA, and divide it by the population to get $55,124. The World Bank shows the per capita GDP (PPP) of Japan at $45,572. Both of these figures are adjusted for cost of living and inflation (they're in 2022 international dollars).

129

10,617

Matthew Barnett · Dec 20, 2022 · 10:27 PM UTC

Matthew Barnett

@MatthewJBar

20 Dec 2022

Probably the most dubious assumption I made is that OpenAI will have enough high quality data to train their model compute-optimally. They've likely been working to scrape as much data from the internet as possible. But they may have hit a limit, e.g. see lesswrong.com/posts/6Fpvch8R…

chinchilla's wild implications — LessWrong

The DeepMind paper that introduced Chinchilla revealed that we've been using way too many parameters and not enough data for large language models. T…

lesswrong.com

137

24,838

Matthew Barnett · Jun 19, 2025 · 11:32 PM UTC

Matthew Barnett

@MatthewJBar

19 Jun 2025

One thing earlier futurists missed was that behavioral cloning is a lot easier than brain scanning and detailed simulation. I expect the first human mind uploads will be deep learning models fine-tuned on a person's behavioral data, without needing full neuron-level duplication.

139

13,247

Matthew Barnett · Dec 20, 2022 · 10:27 PM UTC

Matthew Barnett

@MatthewJBar

20 Dec 2022

With the algorithmic adjustment, the qualitative improvement from GPT-3 (vanilla) to GPT-4 is comparable to the improvement from GPT-2 to GPT-3. Since that was a rather big jump, I expect many will be stunned by GPT-4, especially those who expected strong diminishing returns.

139

28,854

Matthew Barnett · Jul 6, 2024 · 5:25 AM UTC

Matthew Barnett

@MatthewJBar

6 Jul 2024

I am quite skeptical of the concept of "human values" as it is typically used in many AI risk arguments. The concept seems to imply that humans basically all have the same values by virtue of their species membership, but this seems like an empirically unfounded theory.

133

10,211

Matthew Barnett · Jun 4, 2023 · 10:43 PM UTC

Matthew Barnett

@MatthewJBar

4 Jun 2023

Joseph Carlsmith estimated that the human brain uses approximately 10^15 FLOP/s. Over 30 years, that's about 10^24 FLOP. Language models exploded in popularity in the last year, timed almost exactly with the release of ML models trained using over 10^24 FLOP.

138

19,925

Matthew Barnett · Apr 6, 2023 · 12:40 AM UTC

Matthew Barnett

@MatthewJBar

6 Apr 2023

I recently criticized the calls to pause model scaling. However, my arguments were brief. Therefore, I thought it might be valuable to elaborate on my view that we should be cautious about slowing down AI progress. 🧵

134

45,925

Matthew Barnett · Dec 20, 2022 · 10:26 PM UTC

Matthew Barnett

@MatthewJBar

20 Dec 2022

In a blog post from 2020, Microsoft announced a new supercomputer for the exclusive purpose of training large ML models for OpenAI. They stated that "Compared with other machines listed on the TOP500 supercomputers in the world, it ranks in the top five". blogs.microsoft.com/ai/opena…

Microsoft announces new supercomputer, lays out vision for future AI

Microsoft has built one of the top five publicly disclosed supercomputers in the world, with new infrastructure available to train extremely large AI models.

news.microsoft.com

137

36,255

Matthew Barnett · Dec 25, 2023 · 9:59 PM UTC

Matthew Barnett

@MatthewJBar

25 Dec 2023

A lot of people still seem to have the impression that AGI will be useful by being a smart thing we keep inside a lab doing science, like a lone genius. I disagree. The main reason AGI is useful is because we can deploy billions of them to automate labor everywhere.

129

47,873

Matthew Barnett · Jan 10, 2023 · 7:32 AM UTC

Matthew Barnett

@MatthewJBar

10 Jan 2023

I opened a Manifold Market about whether GPT-4 will get the Monty *Fall* problem correct. manifold.markets/MatthewBarn…

133

34,516

Matthew Barnett · Apr 25, 2024 · 3:57 AM UTC

Matthew Barnett

@MatthewJBar

25 Apr 2024

In a 2021 discussion, both Paul Christiano and Eliezer Yudkowsky agreed that no AI would pass a hard 1-hour Turing Test until the "End Times", i.e. until after the world ended or after a huge economic acceleration. I suspect these predictions will look quite bad within 6 years.

131

13,295

Matthew Barnett · Jul 19, 2023 · 7:30 AM UTC

Matthew Barnett

@MatthewJBar

19 Jul 2023

Replying to @EgeErdil2

Potentially the result of framing. "Is it true that..." vs. "which is higher?"

125

7,950

Matthew Barnett · Oct 2, 2021 · 4:46 PM UTC

Matthew Barnett

@MatthewJBar

2 Oct 2021

It's crazy to me how some people still seem to think whole brain emulation has a >25% chance of coming before de novo AGI, even after GPT-3 and a decade of very slow progress on brain scanning/emulation. @robinhanson why? lesswrong.com/posts/mHqQxwKu…

Whole Brain Emulation: No Progress on C. elegans After 10 Years — LessWrong

Since the early 21st century, some transhumanist proponents and futuristic researchers claim that Whole Brain Emulation (WBE) is not merely science f…

lesswrong.com

126

Matthew Barnett · May 29, 2022 · 9:32 AM UTC

Matthew Barnett

@MatthewJBar

29 May 2022

One of my least charitable philosophical takes: I don’t see how non-consequentialist moral theories are anything more than the result of people not thinking very hard about how to actually take actions in the real world.

122

Matthew Barnett · Jan 8, 2023 · 10:54 PM UTC

Matthew Barnett

@MatthewJBar

8 Jan 2023

One of the most common arguments against AGI being near is the following take: AI has gone through many boom and bust cycles before in which people thought we were close, but we ended up being far. This boom will also bust. Ultimately, I find this argument quite weak. 🧵

125

46,502

Matthew Barnett · Sep 27, 2025 · 7:14 PM UTC

Matthew Barnett

@MatthewJBar

27 Sep 2025

I wonder how much higher birth rates would be if everyone were familiar with these research results.

120

9,469

Matthew Barnett · Jul 8, 2022 · 8:11 PM UTC

Matthew Barnett

@MatthewJBar

8 Jul 2022

Disagreement about AI timelines is often framed as a disagreement about the anticipated rate of future AI progress. However, I believe the real disagreement is often not about the rate of progress, but about the threshold required for AI to be transformative.

117

Matthew Barnett · Apr 20, 2024 · 5:43 AM UTC

Matthew Barnett

@MatthewJBar

20 Apr 2024

One reason why I'm skeptical of theoretical AI alignment research comes from looking at its empirical track record. For example, these screenshots are from a well-received post from 2019 by a smart alignment researcher. Did this agenda end up being useful at all for aligning AI?

116

27,411

Matthew Barnett · Nov 11, 2022 · 7:29 PM UTC

Matthew Barnett

@MatthewJBar

11 Nov 2022

The following are my thoughts on recent events involving FTX.🧵 Although I was completely unaware of any fraud until this week, I intend to fully return any money that I received, even indirectly, from victims of fraud or other financial crimes, including money I already spent.

114

Matthew Barnett · Dec 21, 2023 · 7:34 PM UTC

Matthew Barnett

@MatthewJBar

21 Dec 2023

My experience of watching other people vaguely informs me that this shift will become more common in the future as AGI draws nearer. Many people who are currently excited about AI seem like they'd become way more fearful if they thought actual superintelligence is arriving soon.

Aleph

@woke8yearold

21 Dec 2023

when my AGI timeline was 30-50 years vs when it became like 5 years

115

8,807

Matthew Barnett · Oct 28, 2023 · 3:45 AM UTC

Matthew Barnett

@MatthewJBar

28 Oct 2023

This new executive order doesn't appear to focus much at all on things EAs care about. I see nothing about responsible scaling, misalignment, or pausing AI. This is worth noting, since I think some had the impression that EA-style AI safety concerns had already become popular.

JgaltTweets @JgaltTweets

28 Oct 2023

Politico: 'The [AI] order... [will] create a raft of new government offices and task forces and pave the way for the use of more AI in nearly every facet of life touched by the federal government, from health care to education, trade to housing and more' politico.com/news/2023/10/27…

114

40,855

Matthew Barnett · Mar 27, 2024 · 2:54 AM UTC

Matthew Barnett

@MatthewJBar

27 Mar 2024

My own basic calculations suggest that, given the potential for increased investment and hardware progress, we could very soon move through a large fraction of the remaining compute gap between the current frontier models and the literal amount of computation used by evolution.

125

10,919

Matthew Barnett · Mar 31, 2023 · 4:21 AM UTC

Matthew Barnett

@MatthewJBar

31 Mar 2023

This is not true. Yudkowsky wrote that you could create AGI by scaling compute. He just thought you could do it more easily (and more safely) if you understood exactly how intelligence works. lesswrong.com/posts/fKofLyep…

frye

@___frye

30 Mar 2023

reminder that pre-gpt, yud spent years arguing that: 1) neural nets would never produce true ai 2) scaling compute would never produce true ai 3) true ai could only be produced by a genius programmer who imbued the machine with logic and reason, directly from his own mind

115

18,010

Matthew Barnett · Apr 7, 2023 · 8:18 PM UTC

Matthew Barnett

@MatthewJBar

7 Apr 2023

If your probability of AI-doom this century is higher than 90%, can you give me any single concrete prediction about the world before doom that you expect I (or other people) would disagree with?

112

48,587

Matthew Barnett · Oct 12, 2024 · 4:39 AM UTC

Matthew Barnett

@MatthewJBar

12 Oct 2024

I think it's generally better to state what you think is true, and likely to occur, rather than telling a story that you think is "good from a societal perspective". What matters is whether the tame version of the future is accurate, not whether society is ready to hear about it.

Jacob Rintamaki

@jacobrintamaki

11 Oct 2024

some of you need to touch grass lmaoo

117

11,535

Matthew Barnett · Sep 21, 2025 · 11:51 PM UTC

Matthew Barnett

@MatthewJBar

21 Sep 2025

I continue to think that a benign AI takeover is likely both inevitable and desirable. As AIs become more agentic and capable, they will gradually assume more responsibilities, gain legal rights, and earn social influence. There's no need for a dramatic coup, or treacherous turn.

114

14,506

Matthew Barnett · Feb 10, 2025 · 1:59 AM UTC

Matthew Barnett

@MatthewJBar

10 Feb 2025

I think it's a pretty clear mistake to focus on whether AI will "make everyone unemployed". Jobs can always be created by offering workers arbitrarily low wages. What matters is not whether people can find employment, but whether they'll receive a meaningful level of income.

115

5,752

Matthew Barnett · Jun 10, 2023 · 9:56 PM UTC

Matthew Barnett

@MatthewJBar

10 Jun 2023

I feel like a lot of people are assuming that LLM scaling over the next 4 years will resemble LLM scaling over the last 4 years, but that seems unlikely to me. GPT-2 was reportedly trained at a cost of $256/h. It's much easier to scale up fast if that's where you're starting.

108

33,682

Matthew Barnett · Mar 29, 2023 · 5:13 AM UTC

Matthew Barnett

@MatthewJBar

29 Mar 2023

Discontinuous AI progress is probably less safe than continuous, or incremental progress. That's because continuous progress is more predictable, and better allows us cope with challenges as they arise, compared to the alternative in which powerful AI suddenly arrives.

109

7,180