Known as Mad Max for my unorthodox ideas and passion for adventure, my scientific interests range from artificial intelligence to the ultimate nature of reality

MIT
Here's why I think there's now a one-in-six chance of an imminent global #NuclearWar, and why I appreciate @elonmusk and others urging de-escalation, which is IMHO in the national security interest of all nations: lesswrong.com/posts/Dod9AWz8…
3,307
4,386
31,329
Our paper making neural networks more modular, sparse and interpretable by penalizing neuron length is now on arXiv. Less is more! @ZimingLiu11 arxiv.org/abs/2305.08746
63
384
2,606
567,292
Most of my MIT colleagues are in denial of this coming tsunami
Anthropic: Intellectual AI capabilities matching or exceeding that of Nobel Prize winners across most disciplines—including biology, computer science, mathematics, and engineering by 2026-2027 "Our own recent work adds further evidence to the idea that powerful AI will arrive soon" The big AI companies never tire of emphasizing how close superintelligence is. Yesterday, OpenAI wrote a security post in which they assume that AGI is coming very soon. And today, Anthropic has followed suit, saying that superintelligence with incredible ability will come as soon as next year! 99% of people still don't understand what this means. Imagine having a scientist with the intelligence and creativity of an Albert Einstein or a Max Planck. And this AI entity can be easily prevented. You create a hundred of these entities and let them solve a problem. It will develop completely new approaches, never before imagined theories will arise. Literally an intelligence explosion will happen. We have to finally understand this. It's already happening next year. This year we will see rapid improvements and by the end of the year outstanding agents. But next year everything will change forever.
144
311
2,586
445,326
Only about 18% of Americans voted for #realdonaldtrump yesterday – the biggest block chose not to vote at all! I just made this infographic:
514
2,427
2,130
We just made the most scientifically complete #NuclearWar simulation to date, from fires to famine. As you can see, about 99% die in the US, Europe, Russia & China. Winner: nobody.
How would a nuclear war between Russia and the U.S. affect you? Who would survive? Watch and share our most realistic nuclear war simulation yet:
188
876
2,163
707,669
My beloved dad died peacefully this morning, after 92 inspiring orbits around the sun, retaining his dark humor and epic stoicism until the very end. I feel so grateful for all his love, wisdom and encouragement, and for getting to do 53 orbits together. ❤️
114
32
2,252
No, LLM's aren't mere stochastic parrots: Llama-2 contains a detailed model of the world, quite literally! We even discover a "longitude neuron".
Do language models have an internal world model? A sense of time? At multiple spatiotemporal scales? In a new paper with @tegmark we provide evidence that they do by finding a literal map of the world inside the activations of Llama-2!
78
339
2,236
800,096
Like @EmmanuelMacron and @ElonMusk, I don't want a world war – and I won't apologize for saying this. If you agree, please write #idontwantaworldwar as a reply to this tweet!
What nuclear war looks like from space based on data from peer-reviewed science papers. A Nature Food paper today suggests that over 98% would starve to death in the US, Europe, China & Russia.
777
358
2,024
Agreed: an unregulated race to the bottom will end badly for the human race.
Dear everyone who wants to regulate and slow down AI: please stop fighting over who has the Most Correct Reason for the slow down. Just work together and make it happen! Reasons in alphabetical order: * autonomous weapons * bias * biosecurity * children's safety (more...)
95
154
1,720
1,879,460
Our new AI paper reveals surprising geometric structure in the LLM-learned concepts: 1) They form brain-like "lobes", 2) they form "semantic crystals" much more precise than it first seems, and 3) the concept cloud is more fractal than round:
1/6 New paper! “The Geometry of Concepts: Sparse Autoencoder Feature Structure.” We find that the concept universe of SAE features has interesting structure at three levels: 1) “atomic” small-scale, 2) “brain” intermediate-scale, and 3) “galaxy” large-scale!
50
370
1,802
272,103
In an international climate treaty hopeless? No! 😃 That's how we mended the #OzoneHole, thanks to the heroes in this video, so we can succeed again with our future climate! #FutureOfLifeAward #Optimism piped.video/watch?v=1iGISskc…
55
142
1,706
To those of you who claiming that smarter-than-human AI is decades away, what *specific tasks* are you willing to bet me that AI won't be able to do within 5 years?
OpenAI just announced ChatGPT's new real-time conversational chat. The model can understand both audio AND video, and can even detect emotion in your voice. This is insane.
530
168
1,674
891,831
It was great fun being back on Lex' podcast and getting to discuss AI risk & hope without having to boil big ideas into tiny soundbites! piped.video/watch?v=VcVfceTs…
130
152
1,461
354,701
With their impeccable safety culture, never letting things get deployed before they’re properly tested, I’m confident that ⁦@Microsoft⁩ is ready to safely handle the smarter-than-human AI that they’re trying to build with ⁦@OpenAI
48
77
599
45,653
2022 intellectual debate climate in a nutshell: - "Nuclear war is bad" - "You're doing Putin's bidding, period!"
183
101
1,358
You know that AI is taking off when...Microsoft plans to power data centers with nuclear reactors: jobs.careers.microsoft.com/g…
78
246
1,415
458,373
It's disgraceful that US & EU policymakers prefer getting advice on how to regulate AI from AI company leaders instead of academic experts like Bengio, Hinton & Russell who lack conflicts of interest: time.com/6288245/openai-eu-l…
82
174
1,304
462,012
Let's not just focus on whether #GPT4 will do more harm or good on the job market, but also on whether its coding skills will hasten the arrival of #superintelligence – for which AI safety researchers have so far failed to discover any safety guarantees: piped.video/watch?v=3Om9ssTm…
84
157
1,293
1,099,887
Please welcome our first class of @VitalikButerin Fellows in AI Existential Safety, and consider collaborating with them to ensure that AI become the best rather than worst thing to happen to humanity! futureoflife.org/team/fellow…
45
200
1,269
I'll miss you, @StephenHawking8! I've lost a long-time collaborator and, above all, a great inspiration, always reminding me of how seemingly insurmountable challenges can be overcome with creativity, willpower and positive attitude: bbc.com/news/uk-43396008
11
183
1,237
Google 2000: “Don’t be evil” Google 2018: “At least don’t apply AI in ways likely to cause overall harm” Google 2025: “F*ck it!”
117
176
1,240
63,107
AI can't reason and this seminal paper proves it:
🤯 MIND-BLOWN! A new paper just SHATTERED everything we thought we knew about AI reasoning! This is paradigm-shifting. A MUST-READ. Full breakdown below 👇 🧵 1/23
189
138
1,238
319,224
I'm excited that our new paper on machine learning for physics was just published in Physical Review Letters. We show how to auto-discover conservation laws from observed data alone using a cool technique for measuring the dimensionality of a data set. journals.aps.org/prl/abstrac…
13
202
1,171
A stunningly broad coalition has come out against Skynet: AI researchers, faith leaders, business pioneers, policymakers, NatSec folks and actors stand together, from Bannon & Beck to Hinton, Wozniak & Prince Harry. We stand together because we want a human future. #KeepTheFutureHuman
322
311
1,202
722,628
When I wrote Life 3.0, I never imagined it wouldn’t be read like this at Burning Man… 🌃 piped.video/FLH4OTkasUQ?feature…
26
12
198
1,172,664
I feel honored and excited to be back to @LexFridman's podcast; he's really deep, and has a unique knack for getting into the most important and exciting questions about tech, life & reality. piped.video/RL4j4KPwNGM
Here's my conversation with Max Tegmark (@tegmark). Our first chat was episode #1 of this podcast. Now he's back! We talk about the intersection of machine learning and physics, and also about how to avoid near-term and long-term existential threats of AI. piped.video/watch?v=RL4j4KPw…
60
86
1,064
You can always trust both tech companies and the government to respect your data privacy:
Big Brother approves: OpenAI hires ex-NSA chief Paul Nakasone to ensure AI benefits humanity. Trust your secrets with the best!
17
10
107
14,651
My mind just got blown...
83
204
1,031
158,552
Google DeepMind CEO @demishassabis⁩ says it’s “insane” to say there’s nothing to worry about with AI:
115
152
952
100,313
Despite all the hype, artificial intelligence will never outsmart humans - and our seminal new paper rigorously proves it: #AI #AGI #superintelligence arxiv.org/pdf/1703.10987.pdf
116
136
893
We can in stop arguing about whether AI can pass the Turing test:
It's over. ChatGPT 4.5 passes the Turing Test.
111
127
893
167,071
Having fun discussing the future of AI
52
28
866
47,269
With our new #AI algorithm, we finally managed to discover new conservation laws that domain experts didn't know about, in both fluid mechanics and atmospheric chemistry: arxiv.org/abs/2305.19525
27
138
842
202,668
Congrats ⁦@geoffreyhinton⁩ and John Hopfield on the 2024 Nobel Prize in physics! 🎉🥳 Your remarkable work has helped demonstrate that intelligence is a physical process - ultimately the most powerful one in our universe.
36
71
872
40,254
What I’ve learned at the Paris AI summit so far:
128
97
839
114,084
Our new AI mechanistic interpretability paper shows that LLMs are surprisingly clever: representing 2-digit numbers on a line is noisy, so they represent them on a generalized helix to get better addition accuracy, seemingly exploiting modular addition digit by digit:
(1/N) LLMs represent numbers on a helix? And use trigonometry to do addition? Answers below 🧵
36
107
835
90,922
I'm excited about our method for machine-learning new physics. It auto-discovers e.g. Neptune & gravitational radiation by detecting energy conservation violation even when the physical laws are unknown: arxiv.org/abs/2106.00026
33
133
791
We just produced this sequel to our Slaughterbots movie, since they're now here for real and the UN is debating whether to #BanSlaughterbots. What do *you* prefer? piped.video/watch?v=9rDo1QxI…
49
125
750
Here's why the @UN Security Council should agree that AI in nuclear command-and-control is a terrible idea when they discuss the topic tomorrow: piped.video/watch?v=w9npWiTO…
56
107
688
134,809
Let's make #AI like biotech, where companies must demonstrate safety, rather than the civilian nuclear industry, where poor safety standards gave us Three Mile Island, Chernobyl, Fukushima and a backlash that crushed the industry: bnnbloomberg.ca/video/push-f…
125
98
684
1,628,308
After 9 years, without giving a reason, Facebook just canceled my account with tens of thousands of followers and countless conversations. Are you next?
123
70
651
Sam, are you conflating de-escalation with capitulation? Do you agree that taunting Putin with tweets of Marilyn Monroe singing happy birthday by his burning bridge is goading him to escalate, which isn't in our interest and should be criticized?
294
20
605
AI accelerationists: Nobody would ever build AI to kill people. DoD: LOL 🤣
We’re making big bets- and we are going to deliver on those bets.  First up: we’re going to field attritable autonomous systems at a scale of multiple thousands, in multiple domains, within the next 18-to-24 months.  Together, we can do this. @NDIAToday #EmergingTechETI
79
112
629
201,970
I’ve been shocked to discover exactly this over the years through personal conversations. It helps explain why some AI researchers aren’t more bothered by human extinction risk: It’s *not* that they find it unlikely, but that they welcome it!
There are a significant number of people in the AI research community who explicitly think humans should be replaced by AI as the natural next step in evolution, and the sooner the better!
106
93
610
167,498
If you're still not concerned about humanity losing control of recursively self-improving AI, see if #DeepLearning Godfather Geoffrey Hinton can persuade you: piped.video/qpoRO378qRY?t=2148
57
97
586
539,302
I like you, Yann, and as a colleague I'd like to encourage you to apologize to @dan_hendricks for attacking him for his religious upbringing. You really debase yourself by stooping so low as to make lame ad hominem attacks. FYI, I'm not from an ultra-religious family, but as you know, I share Dan's concerns about AI causing extinction.
As I have pointed out before, AI doomerism is a kind of apocalyptic cult. Why would its most vocal advocates come from ultra-religious families (that they broke away from because of science)?
62
13
452
227,210
OK, Andrew: I'll bet you $1000 that there will be fewer US programming jobs a year from now than today. Deal?
Some people today are discouraging others from learning programming on the grounds AI will automate it. This advice will be seen as some of the worst career advice ever given. I disagree with the Turing Award and Nobel prize winner who wrote, “It is far more likely that the programming occupation will become extinct [...] than that it will become all-powerful. More and more, computers will program themselves.”​ Statements discouraging people from learning to code are harmful! In the 1960s, when programming moved from punchcards (where a programmer had to laboriously make holes in physical cards to write code character by character) to keyboards with terminals, programming became easier. And that made it a better time than before to begin programming. Yet it was in this era that Nobel laureate Herb Simon wrote the words quoted in the first paragraph. Today’s arguments not to learn to code continue to echo his comment. As coding becomes easier, more people should code, not fewer! Over the past few decades, as programming has moved from assembly language to higher-level languages like C, from desktop to cloud, from raw text editors to IDEs to AI assisted coding where sometimes one barely even looks at the generated code (which some coders recently started to call vibe coding), it is getting easier with each step. I wrote previously that I see tech-savvy people coordinating AI tools to move toward being 10x professionals — individuals who have 10 times the impact of the average person in their field. I am increasingly convinced that the best way for many people to accomplish this is not to be just consumers of AI applications, but to learn enough coding to use AI-assisted coding tools effectively. One question I’m asked most often is what someone should do who is worried about job displacement by AI. My answer is: Learn about AI and take control of it, because one of the most important skills in the future will be the ability to tell a computer exactly what you want, so it can do that for you. Coding (or getting AI to code for you) is a great way to do that. When I was working on the course Generative AI for Everyone and needed to generate AI artwork for the background images, I worked with a collaborator who had studied art history and knew the language of art. He prompted Midjourney with terminology based on the historical style, palette, artist inspiration and so on — using the language of art — to get the result he wanted. I didn’t know this language, and my paltry attempts at prompting could not deliver as effective a result. Similarly, scientists, analysts, marketers, recruiters, and people of a wide range of professions who understand the language of software through their knowledge of coding can tell an LLM or an AI-enabled IDE what they want much more precisely, and get much better results. As these tools are continuing to make coding easier, this is the best time yet to learn to code, to learn the language of software, and learn to make computers do exactly what you want them to do. [Original text: deeplearning.ai/the-batch/is… ]
90
34
632
73,216
You know you’re winning an argument when your opponents shift their attacks from it to your character… 😜
I have a terrible confession to make: I giggled 😊
74
18
616
157,379
I'm excited that my AI book made #BarackObama's 2018 reading list: cnn.com/2018/12/28/politics/…
47
72
598
We build a lie detector for large language models and show that they represent uncontroversially true and false sentences on opposite sides in a linear space. This is IMHO further evidence they they’re not just overhyped “stochastic parrots”:
Do language models know whether statements are true/false? And if so, what's the best way to "read an LLM's mind"? In a new paper with @tegmark, we explore how LLMs represent truth. 1/N
27
79
620
103,468
A Russia-US #NuclearWar would be humanity's dumbest act yet & may kill about 99% in the US, Russia and Europe as seen in our simulation:
96
222
594
104,787
Another ship-jump. We’re obviously closer to figuring out how to build smarter-than-human AI than to figuring out how to control it, yet we keep racing toward the cliff:
Another OpenAI safety researcher has quit: "Honestly I am pretty terrified."
68
84
608
53,250
Only 4% of Americans strongly disagree with the proposed pause on AI more powerful than #GPT4, so loud pause critics linked to big tech aren't representative. Upton Sinclair said "It is difficult to get a man to understand something, when his salary depends on his not understanding it”. today.yougov.com/topics/tech…
124
97
589
182,864
I never in my wildest dreams thought that one of our nerdy AI papers would end up in a meme...
16
31
571
71,961
IMHO, @SnoopDogg now captures the magnitude of what's happening in #AI better than most tech pundits with their financial conflicts of interest – and most policymakers and corporate lobbyists...
26
95
589
76,335
True and deeply messed up. One law for the rich and one for the poor.
Meta illegaly downloaded 80+ terabytes of books from LibGen, Anna's Archive, and Z-library to train their AI models. In 2010, Aaron Swartz downloaded only 70 GBs of articles from JSTOR (0.0875% of Meta). Faced $1 million in fine and 35 years in jail. Took his own life in 2013.
16
72
597
42,800
In my humble opinion, launching fully autonomous AI researchers and letting AI self-improve is a type of digital gain-of-function research that's extremely reckless - what could possibly go wrong?
OpenAI just announced that they aim to develop fully automated AI researchers by March 2028. This forms part of an incredibly dangerous path to develop superintelligence. Here's why. Thread 🧵
101
104
605
68,403
This may be the first time ever that I strongly disagree with your assessment. I expect AGI to quickly lead to ASI and, if you or anyone else has a convincing plan for how not to lose control over it, I challenge you to post it here for public scrutiny. I’ve read and heard many such “plans”, all hand-wavy and based on hopium.
I hereby disavow the talking-point that "no one knows how to steer AGI". I think there are people who know. It's hard to tell who does or doesn't, which means society is taking a big risk by letting people make AGI. But that's what seems to be happening.
74
54
588
119,431
We found that concepts in an AI brain (nerdspeak: “LLM SAE features”) that activate close together in time are also close together in (“activation”) space, akin to lobes in a biological brain:
If you cluster language model features (from SAEs) into two groups based on whether they tend to fire together in the same document, you find two "lobes" of features that also turn out to be geometrically distinct! Math vs. prose features separate in this t-SNE plot...
22
101
582
56,585
The silence about this craziness in both mainstream media and general conversation is deafening:
Google's Chief AGI Scientist Shane Legg: 50% chance of AGI within 3 years, then 5-50% chance of extinction ONE YEAR LATER Russell: "[AI CEOs] are playing Russian Roulette with the entire human race, without our permission." "Why are we letting them to do this?" "I would NOT let someone come into my house and play Russian Roulette with one of my children... would you?" "This impending transition to a world where we are joined by superhuman entities, if that's the biggest event in human history, then it seems that everyone should be involved in the decisions about whether and how this happens. At the moment, the ones making the decisions are the CEOs of the companies, and they have a vested interest. A couple of them have said there's a significant chance, like 10 or 25% chance, of human extinction." (Stuart Russell, btw, wrote the textbook on AI) Reminder 1: In the largest ever survey of 2,778 AI researchers, the average researcher thought there was a 16% chance AI would cause human extinction. Reminder 2: 82% Americans want to stop or slow down AI. (ht @vitrupo for the vid)
61
120
574
81,093
The main risk with advanced #AI isn't malice, but competence: that it accomplishes goals that aren't aligned with ours. Here are some hilarious and cautionary examples of how this can happen accidentally: aiweirdness.com/post/1728947…
23
244
544
New AI breakthrough: in the space of a few days, starting tabula rasa, #AI blows away 3,000 yrs of human Go wisdom: technologyreview.com/s/60914…
34
295
526
I’m struck by how out-of-touch many of my tech colleagues are in their rich nerd echo chamber, unaware that most people are against making humans economically obsolete with AI:
This is surreal to watch: SXSW audiences were booing and screaming at pro-AI videos SXSW is (partially) a tech conference! They lose their shit when it says “AI makes us more human” I repeat: in 1-5 years, if we're still alive, I expect the biggest protests humanity has ever seen Why? Even if AGI goes great overall - and it might! - the amount of disruption will be unprecedented... ...entire careers vanishing in the blink of an eye... ...skills someone spent a lifetime acquiring suddenly useless... ...industries collapsing... ...political turmoil... ...sacred cows getting slaughtered left and right... If you’ve ever tried to introduce a new idea at work you know that disrupting even a SINGLE status quo is often resisted, and AGI will be The Mother Of All Status Quo Disruptors
91
61
552
181,690
As we celebrate the start of our new annual orbit around the Sun, it’s fun to remember that our a Sun in turn orbits our Galaxy every 300 million years - we’ve done about 15 Galactic laps so far 🎉🥳
This is scary, like the sun is a kidnapper taking us to a second location nitter.app/tuidelescribano/status…
43
86
511
179,160
Yann, my position is markedly different: 1. Extreme power concentration must be avoided. 2. Superintelligence is likely to kill us all if anyone builds it before figuring out how to make it safe. 3. Hence *nobody* should be allowed to build it before it can be guaranteed to be safe.
The Doomer's Delusion: 1. AI is likely to kill us all 2. Hence AI must be monopolized by a small number of companies under tight regulatory control. 3. Hence AI systems must have a remote kill switch. 4. Hence foundation model builders must be eternally liable for bad uses of their models and derived versions of it. 5. Hence open source AI must be banned. 6. But open source is popular, so we're just going to say that we are pro open source but also say we need some sort of regulatory agency to oversee it. 7. We're going to scare the hell out of the public and their representatives with prophecies of doom. 8. But we'll make sure to appear much more respectable than the most extreme doomers. 9. To look even more respectable, we'll create a one-person institute to promote AI safety. 10. We'll get insane levels of funding from well-intentioned but clueless billionaires who are scared sh*tless of catastrophe scenarios, got too rich too quickly, have too much time on their hands, but should know better. 11. We'll claim that the majority of prominent scientists agree with us, even though said scientists are an infinitesimal minority in the AI community.
202
56
547
895,503
Here's a fascinatingly well-researched scenario where AI takes over in a few years and then kills all humans. Whatever actually happens will probably feel comparably sci-fi:
"How, exactly, could AI take over by 2027?" Introducing AI 2027: a deeply-researched scenario forecast I wrote alongside @slatestarcodex, @eli_lifland, and @thlarsen
73
106
582
73,508
Humanity is now spending more on its quest to get outsmarted than on all science, art and humanitarian missions combined:
53
85
550
225,788
Fascinating livestreamed senate hearing on AI going on right now: piped.video/watch?v=fP5YdyjT…
36
74
295
82,794
Yann, I'd love to hear you make arguments rather than acronyms. Thanks to @RishiSunak & @vonderleyen for realizing that AI xrisk arguments from Turing, Hinton, Bengio, Russell, Altman, Hassabis & Amodei can't be refuted with snark and corporate lobbying alone.
The UK Prime Minister has caught the Existential Fatalistic Risk from AI Delusion disease (EFRAID). Let's hope he doesn't give it to other heads of state before they get the vaccine. "An AI safety summit at Bletchley Park in November is expected to focus almost entirely on existential risks and how to negate them." telegraph.co.uk/business/202…
49
38
553
471,448
Claude 2 on the left, Claude 3 on the right:
claude 3 opus when anthropic runs tests on it
22
50
551
61,527
"Don't regulate AI – just trust the companies!" Does he also support abolishing the FDA and letting biotech companies sell whatever meds they want without FDA approval, because biotech is too complicated for policymakers to understand?
WATCH: Former Google CEO @ericschmidt tells #MTP Reports the companies developing AI should be the ones to establish industry guardrails — not policy makers. “There’s no way a non-industry person can understand what’s possible.”
86
94
508
159,575
Although the Winograd test for AI is notoriously tougher than the Turing test, just-released GPT4 crushes it:
31
77
541
75,056
Tired of silly terminator stories? My summary of the top 9 myths about advanced future AI: futureoflife.org/background/…
20
354
516
Almost everyone I speak with wants tool AI rather than AGI: controllable AI than empowers us rather than overmarketed “digital god” AI that replaces us and that we don’t know how to control. The key point about the Venn diagram below is they we can get basically all the tools we want as long as we don’t put too much “A”, “G” and “I” into the same system:
🛠️ 🗣️ As @tegmark is discussing at @IASEAIorg, the controllable and beneficial AI tools we want are within our reach - but to get them, we must start treating the AI industry like all other high-impact industries, with legally binding safety standards that incentivize companies to innovate and prioritize public safety. 📄 At the link, read our new Policymakers' Guide to AI - including our proposed AI safety standards: bit.ly/4htdQq4
109
84
544
72,844
Being smarter doesn't necessarily make people more moral
Tonight from the Munk Debate on AI @MelMitchell1 and I are debating Max @tegmark and Yoshua Bengio munkdebates.com/debates/arti…
64
54
510
115,439
I'e very excited about our AI latest paper, which shows that an architecture radically different from standard neural network achieves much better accuracy with fewer parameters for interesting physics and math problems. arXiv:2402.05110
MLPs are so foundational, but are there alternatives? MLPs place activation functions on neurons, but can we instead place (learnable) activation functions on weights? Yes, we KAN! We propose Kolmogorov-Arnold Networks (KAN), which are more accurate and interpretable than MLPs.🧵
26
74
515
61,927
- Top AI CEOs and researchers: “Self-improving AI could take over and cause extinction” - ICLR: “Let’s run a workshop on how to make it happen sooner!”
This accepted ICLR workshop is a bad idea.
92
64
563
85,412
DW just interviewed me about why I think #AI will soon pose an existential threat. I really liked the interviewer, who favors depth over soundbites. piped.video/watch?v=ewvpaXOQ…
44
111
503
67,972
We were right to call for a pause on AGI attempts. Governments need to implement binding safety standards ASAP, because even well-meaning companies clearly can’t resist profit pressure to drive humanity toward the cliff. Now OpenAI’s superalignment leads quit. #PauseAI
Replying to @janleike
We are long overdue in getting incredibly serious about the implications of AGI. We must prioritize preparing for them as best we can. Only then can we ensure AGI benefits all of humanity.
83
84
513
108,807
OK, Yann: predict some concrete tasks that you’ll bet me AI won’t be able to do by 2027! 😃
It seems to me that before "urgently figuring out how to control AI systems much smarter than us" we need to have the beginning of a hint of a design for a system smarter than a house cat. Such a sense of urgency reveals an extremely distorted view of reality. No wonder the more based members of the organization seeked to marginalize the superalignment group. It's as if someone had said in 1925 "we urgently need to figure out how to control aircrafts that can transport hundreds of passengers at near the speed of the sound over the oceans." It would have been difficult to make long-haul passenger jets safe before the turbojet was invented and before any aircraft had crossed the atlantic non-stop. Yet, we can now fly halfway around the world on twin-engine jets in complete safety. It didn't require some sort of magical recipe for safety. It took decades of careful engineering and iterative refinements. The process will be similar for intelligent systems. It will take years for them to get as smart as cats, and more years to get as smart as humans, let alone smarter (don't confuse the superhuman knowledge accumulation and retrieval abilities of current LLMs with actual intelligence). It will take years for them to be deployed and fine-tuned for efficiency and safety as they are made smarter and smarter.
115
30
522
195,921
Our new paper tries to quantify how smarter AI can be controlled by dumber AI and humans via nested "scalable oversight". Our best scenario successfully oversees the smarter AI 52% of the time, and the success rate drops as one approaches AGI. My assessment is that the "Compton constant", the probability that a race to AGI culminates in loss of control of Earth, is >90%.
1/10: In our new paper, we develop scaling laws for scalable oversight: oversight and deception ability predictably scale as a function of LLM intelligence! We quantify scaling in four specific oversight settings and then develop optimal strategies for oversight bootstrapping.
68
106
526
158,349
I'm excited that we were able to give a $50k award to the family of Stanislav #Petrov, thanks to whom today isn't the 35 aniversary of WWIII. futureoflife.org/2018/09/26/…
14
125
465
Great new documentary. People who claim there will still be jobs after superintelligence don't understand what superintelligence means.
60
82
502
45,640
Ignore at your own peril:
The Great AI Job Displacement Is Closer Than You Think
50
82
492
58,608
This has been my favorite present from Santa so far:
5
32
465
France's @Durov arrest "to protect us" reminds me of H L Mencken's quote: "The urge to save humanity is almost always a false front for the urge to rule"
First they came for Tiktok, and I did not speak out—      Because I was not twelve years old. Then they came for the Telegram, and I did not speak out—      Because I was using some other app or sth idk. Then they came for literally every other platform for dissent, and I did not speak out—      because bro how tf could i that's the entire point wake up wake up wa—
40
73
464
43,585
Huge news: A who's who of AI leaders agree that "Mitigating the risk of extinction from AI should be a global priority alongside other societal-scale risks such as pandemics and nuclear war." safe.ai/statement-on-ai-risk
40
95
470
54,466
Differences of opinion, but an IMHO really fun, candid and interesting discussion on AGI risk and hope with @gdb, @elonmusk & @netanyahu
36
40
453
330,575
Got data and wonder if there's a formula describing it? Try our new physics-inspired AI Feynman algorithm, published today. It automates what took Kepler 4 years: advances.sciencemag.org/cont…
8
121
462