The Machine Intelligence Research Institute exists to maximize the probability that the creation of smarter-than-human intelligence has a positive impact.

Berkeley, CA
#7 Combined Print & E-Book Nonfiction #8 Hardcover Nonfiction
17
13
193
33,184
New pod with Nate Soares, @So8res, author of "If Anyone Builds It, Everyone Dies". We discuss: - Risk of superintelligent AI - How AI is grown, not built - Why we lose control - The builders' own odds - Can we still stop? Full episode in thread 👇
7
8
41
13,660
"But the elephant being ignored is Anthropic itself. It, and other frontier labs, are racing towards superhuman AI, dragging Chinese copycats along for the ride. The U.S. should take steps to make it harder for China to access dangerous capabilities, but this is not enough; our national security is at risk as long as those capabilities exist at all. And our survival as a species is threatened as long as AI capabilities continue to escalate beyond our ability to contain or control. One of the main drivers of Chinese AI progress is American AI progress. Perpetually denying dangerous AI capabilities to rivals and bad actors while developing them ourselves isn’t really feasible. To quote my colleague Mitch, we may as well try to outrun our own shadow"
2
5
17
775
“If this is all marketing hype — most of it from companies with trillion-dollar initial public offerings on the line — it’s a heck of a bluff, and we should call them on it."
7
50
3,270
The US Government has requested a slow staggered rollout of GPT-5.6, and OpenAI has agreed. During this phase the government will approve each user individually. This will probably be the norm for all frontier models from all labs from now on.
New w/ @leomschwartz @amir: The Trump admin has asked OpenAI to stagger the release of GPT-5.6 over security concerns. On Thursday, CEO Sam Altman told staff that the government will be approving access to GPT-5.6 customer by customer, a highly unusual approach.
290
336
3,004
1,480,426
Great article about lessons from the Cold War for AI governance. MIRI's TGT has been working on just this sort of verification for some time now. Lots more work is needed.
No one in AI trusts their rivals to slow down — so everyone races ahead. New tech could help break that impasse. I wrote about what the Cold War might be able to teach us about AI governance: time.com/article/2026/06/23/…
1
1
7
565
Promising new idea to verify AI chips aren't used to violate an AI pause treaty: use network taps to watch for fingerprints in the traffic between the chips and their users. But what if data is smuggled out via side-channels? MIRI's Naci Cankaya on how to defend against this:
2
3
36
1,792
The Japanese edition of If Anyone Builds It, Everyone Dies has apparently "already gone into its third printing due to overwhelming demand since its release", within two weeks of its release date!
7
10
150
8,985
"The timeline is not years, it is months." Early this morning, Five Eyes—the secretive alliance of intelligence agencies for the US, UK, Canada, Australia, and New Zealand—issued a rare public warning: AI is accelerating the "speed, scale, and sophistication of cyber threats."
1
3
10
651
AI StopWatch turns every dispatch into a podcast episode. Tune in wherever you get your podcasts. aistop.watch/s/podcast
1
3
776
MIRI retweeted
"Those who want to have a sleepless night only need to read [@ESYudkowsky] and [@So8res]' recent book" Former Health Secretary Baroness Bottomley warns of the threat posed by superintelligent AI. She says we need to ensure tech remains the servant of humanity, not its master.
3
15
65
3,159
MIRI retweeted
"The most likely outcome, I think, is that they kill everybody." "Wow." AI researcher Nate Soares (@So8res) on Fox News, explaining why he expects superintelligent AI to lead to human extinction. Source: @willcain's Will Cain Country
6
20
60
3,688
«Если какая‑либо компания или группа на планете создаст искусственный сверхинтеллект, используя что‑либо, даже отдаленно похожее на современные технологии, и основываясь на понимании ИИ, даже отдаленно похожем на нынешнее, то погибнут все люди на всей Земле.» «Если кто-то его создаст — все погибнут» авторов @ESYudkowsky и @So8res теперь доступна на русском языке
1
7
29
1,137
MIRI retweeted
Replying to @deanwball
1. Our agreement permits inference, broadly, to continue. It only limits AI training above 10^24 FLOP. In Article VIII, we establish that the agreement restricts only research that advances toward ASI or undermines verification. We permit medical diagnostics and drug discovery, for example, among other non-general capabilities. As long as the research focuses on a narrow domain (like materials science or climate modeling) and does not increase "general cognitive capabilities," it is not subject to restrictions. There is no seizing of assets, but rather mandatory monitoring which can involve forced consolidation of compute into monitored datacenters by the signatory government domestically. (No international centralization of AI monitoring!) So, for example, a small university lab’s research compute would be relocated to a monitored datacenter but is still owned and (remotely) operated by that lab. 2. We architected our agreement to specifically not ban the additional production and installation of compute. The value of such for inference and real-world economic activity is only increasing. So the total economic impact on parts of the AI supply chain are probably far less than existential. We note that many of the relevant players have multi-year backlogs to clear at full production capacity, and building out the infrastructure for model development is only a part of that. 3. Nothing happens to consumer AI apps. If anything additional resources are freed up which can accelerate adoption and integration of these tools. 4. Individual governments handle their own verification and monitoring, but some transparency, random inspections, etc. to either China or the US will be required. Our agreement opts for comprehensive domestic monitoring and shares the least information with rivals (directly) which establishes required confidence. There is no multilateral international AI inspection apparatus in our current version of the agreement, which is a change from the first version we published. 5. Robotics and other applications are unaffected to the extent that progress can continue without developing new models with runs that exceed the compute limit. 6. Our agreement does not attempt to set up a “binding” constraint, in that we do not consider the legal, reputational constraints on US/China to be the active ingredient which leads to stability of the treaty. Instead, we establish a clear understanding on both sides of what behavior is restricted and what consequences must follow otherwise, with enough transparency and non-interference with intelligence gathering to create confidence in each side that the other is adhering to the agreement. Rather than legal constraints, I consider deterrence as the framework for thinking about the stability of the agreement. This is the historical method for constraining a major power, including the use of strategically significant assets. Nuclear deterrence prevented Soviet employment of its conventional superiority against Western Europe. A Soviet dead hand blocked what was otherwise the possibility of a perfect first strike (nuclear or otherwise). Stability emerges when both sides coordinate to make the consequences of aggression clear. Is that a “binding constraint” that one side “accepts”? No, not legally, textually. Not in the sense of “I staked my reputation and my honor in the arena of international relations”. Where agreements have a role to play is in helping to maintain the conditions for successful deterrence. An example of this is the anti-ballistic missile treaty, in which both sides limited their deployment of missile defenses so that deterrence could be maintained.
1
4
20
1,622
Godfather of AI (and world's #1 most cited scientist) announces his support for a coordinated global AI pause!
If leading AI companies are indeed approaching the point of recursive self-improvement, a coordinated, verifiable, and universally applied pause is probably the only responsible solution to mitigate several major AI risks; at least until safety guarantees are developed and demonstrated. Ensuring that such a moratorium is respected would require sincere collaboration between various countries and companies, but I definitely believe it is achievable if others follow in @AnthropicAI's footsteps.
14
19
152
14,843
MIRI retweeted
That there are no urgent hearings on Capitol Hill, no serious legislation in the pipeline, and no persistent questioning of candidates for higher office on their proposed approaches to AI is incredible given how transformative the technology is and how fast it is moving.
Our highest and most urgent national priority should be AI safeguards. The risks of AI weapons, pathogens, mass unemployment, surveillance, and even extinction must not continue to be largely ignored.
17
116
398
36,598
MIRI retweeted
now on the eve of RSI it seems everyone is more mutual conditional pause agreement pilled than they used to be and that seems like a good development
157
86
1,802
276,593
'Some people believe that if machines decide to kill us it's the right thing to do because they're smart' AI researcher Nate Soares reveals that some factions in Silicon Valley mistakenly believe that if an AI is exceptionally intelligent it must also be highly moral. @So8res @Freddygray31
8
11
32
6,188