OG GenAI Skeptic; spoke at US Senate. Warned about hallucinations in 2001. Advocating world models & neurosymbolic AI ever since. Author, Marcus on AI & 6 books

Three thoughts on what really matters: 1. Fuck cancer 2. Friends are irreplaceable 3. The new "Marcus test" for AI is when AI makes a significant dent on cancer May that happen sooner, much sooner, rather than later. In memory of my childhood friend Paul.
285
156
2,827
593,151
Absolutely the most important satire of the LLM era.
My 3-month-old son is now TWICE as big as when he was born. He's on track to weigh 7.5 trillion pounds by age 10
44
4,036
136,627
3,858,735
I never thought I would live to see the President of the United States serving as Russia’s Press Secretary.
1,311
6,276
25,258
817,228
⚠️⚠️⚠️ I have a PhD in Brain and cognitive science, from MIT. 𝗜𝗻 𝗷𝘂𝘀𝘁 𝘁𝗵𝗲 𝗹𝗮𝘀𝘁 𝟮𝟰 𝗵𝗼𝘂𝗿𝘀, 𝗜 𝗵𝗮𝘃𝗲 𝘀𝗲𝗲𝗻 𝟰 𝘀𝗶𝗴𝗻𝘀 𝗼𝗳 𝗯𝗿𝗮𝗶𝗻 𝗱𝗮𝗺𝗮𝗴𝗲 𝗮𝗻𝗱/𝗼𝗿 𝗱𝗲𝗴𝗲𝗻𝗲𝗿𝗮𝘁𝗶𝗼𝗻 𝗶𝗻 𝗧𝗿𝘂𝗺𝗽: - 𝗠𝗼𝘁𝗼𝗿 𝗱𝗲𝗳𝗶𝗰𝗶𝘁: swaying, missing and and stumbling in front of the garbage truck - 𝗟𝗼𝘀𝘀 𝗼𝗳 𝗰𝗼𝗴𝗲𝗻𝗰𝘆: inappropriate musical pause/swaying, similar to longer incident a few weeks ago - 𝗗𝗶𝘀𝗶𝗻𝗵𝗶𝗯𝗶𝘁𝗶𝗼𝗻: threatening Liz Cheney with violence - 𝗔𝗽𝗵𝗮𝘀𝗶𝗮: description of Musk as “good at computer” Things are degenerating, fast. 𝙒𝙝𝙮 𝙞𝙨𝙣’𝙩 𝙩𝙝𝙚 𝙢𝙚𝙙𝙞𝙖 𝙧𝙚𝙥𝙤𝙧𝙩𝙞𝙣𝙜 𝙤𝙣 𝙩𝙝𝙞𝙨 𝙥𝙖𝙩𝙩𝙚𝙧𝙣?
1,317
4,783
16,044
1,063,940
When I started working on AI four decades ago, it simply didn’t occur to me that one of the biggest use cases would be derivative mimicry, transferring value from artists and other creators to megacorporations, using massive amounts of energy. This is not the AI I dreamed of.
OpenAI just released a model that can generate 1-minute videos. You simply cannot argue that these models don't / won't compete with the content they're trained on, and the human creators behind that content. What is the model trained on? Did the training data providers consent to their work being used? The total lack of info from OpenAI on this doesn't inspire confidence. Across the AI industry, people's work is being exploited without consent to build products that compete with that work. This must be controlled by regulators.
198
2,478
15,435
821,345
When LeCun is right, he’s right
234
2,693
12,276
262,584
Why Musk will sacrifice democracy and maybe even jail, in one convenient chart @nytimes
223
4,315
11,586
826,011
BREAKING: Leaked October 27 letter from OpenAI to White House shows that • OpenAI had begun asking for Federal guarantees over a week ago. • Sam’s long attempted walkback yesterday on X after the huge backlash was a total lie. You cannot trust this man, ever. That’s what Ilya saw.
Here is an OpenAI document submitted one week ago where they advocate for including datacenter spend within the “American manufacturing” umbrella There they specifically advocate For Federal loan guarantees Sam Lied to everyone, Again
Community note
1. This is not a "leak". OpenAI submitted a public interest comment to the White House. The document is published on OpenAI's website. cdn.openai.com/pdf/21b88bb5-1… 2. The document calls for expanding tax credits, and for loan guarantees for *manufacturers*. It does not propose a backstop for OAI's loans
290
2,405
13,299
1,258,596
If you live in the U.S. your civil rights will soon be over. This is not a drill.
1,597
1,931
11,811
507,339
A computer scientist’s perspective on vibe coding:
618
1,253
8,581
832,072
I just wrote a great piece for WIRED predicting that the AI bubble will in collapse in 2025, and now I wish I hadn’t. Clearly, I got the year wrong. It’s going to be days or weeks from now, not months.
260
701
6,969
1,612,596
I retired from the American University system at age 49 (after 26 years) so have no direct stake in this. But am here to say that drastically cutting NIH & NSF funding and reducing grant “overhead” to 15% are among the dumbest ideas I have ever seen. America will never be the same. Science will take a huge hit; academic stars will leave; university reputations will crumble, and homegrown talent will be even harder to find. Asia and Europe will profit. Epic unforced errors.
479
1,613
7,552
620,785
Call your Congress people. Ten months almost to the day, OpenAI and NVidia have both rolled out *exactly* the “too big to fail .. but China” game plan I predicted they would be forced to. Our government is welching on food stamps but they are going to bail out the biggest public and private companies of all time — with your money. Don’t stand for it.
The countdown until we are told that LLMs are “too big to fail” starts now. “We can’t afford to lose to China”, they will say, accepting their multibillion dollar bailouts.
104
1,349
6,784
324,326
Mark my words, this week marks peak bubble. Oracle just went up $300B on what appears to be a non-binding deal with OpenAI. •OpenAI doesn’t have $300 billion dollars •They don’t have anywhere near $300 billion dollars •By their own (presumably optimistic) projection, they won’t turn a profit until 2030. •And all this from a company thought (or claimed) that GPT-5 was going to be tantamount to AGI (spoiler alert: it wasn’t) •For good measure Oracle doesn’t have the chips they would need to fulfill the contracts, or even the cash to buy them. I will be astounded if Oracle meets their projections. Further discussion at the link below.
323
541
6,526
824,420
a big deal: @elonmusk, Y. Bengio, S. Russell, ⁦⁦@tegmark⁩, V. Kraknova, P. Maes, ⁦@Grady_Booch, ⁦@AndrewYang⁩, ⁦@tristanharris⁩ & over 1,000 others, including me, have called for a temporary pause on training systems exceeding GPT-4 futureoflife.org/open-letter…
974
1,692
5,994
7,041,405
Wait what? Murati is leaving, Schulman left, Karpathy left, Ilya left, Leike left, Brockman is on leave, perhaps a dozen others left, GPT-5 hasn’t dropped, Sora hasn’t shipped, the company had an operating loss of $5b last year, there is no obvious moat, Meta is giving away similar software for free,many lawsuits pending, and people are valuing this company at $150 billion dollars? Absolutely insane. Investors shouldn’t be pouring more money at higher valuations, they should be asking what is going on.
I shared the following note with the OpenAI team today.
183
623
5,930
1,909,696
Rough Translation: We won’t get fabulously rich if you don’t let us steal, so please don’t make stealing a crime! Don’t make us pay 𝘭𝘪𝘤𝘦𝘯𝘴𝘪𝘯𝘨 fees, either! Sure Netflix might pay billions a year in licensing fees, but *we* shouldn’t have to! More money for us, moar!
62
1,339
5,361
903,701
Sorry, but this tweet did not age well.
145
113
5,903
905,217
Amazing! But, um, ants have six legs. We are about to have a whole generation of children educated by fake videos that are completely plausible to naive audiences yet biologically incorrect. 🤯
"pov footage of an ant navigating the inside of an ant nest" Video generated by Sora
121
670
5,308
293,286
Speaking of bullshit, anyone remember this?
167
673
4,831
693,968
Black Mirror has arrived, ahead of schedule. An entire cast of deepfaked people tricked a CFO out of $25 million. “(In the) multi-person video conference, it turns out that everyone was fake” Deepfaked shit is getting real.
138
1,194
4,650
1,151,914
Replying to @elonmusk
Straight out of 1984. You couldn’t get Grok to align with your own personal beliefs so you are going to rewrite history to make it conform to your views.
153
288
4,470
166,922
“AI is the asbestos we are shoveling into the walls of our society and our descendants will be digging it out for generations” - @doctorow, speaking truth and taking no prisoners.
31
973
4,588
77,957
Feel the AGI, @sama. No PhD student can match this!
219
219
4,482
697,548
The OpenAI board was right. Even on little things, Sam is not consistently candid. Earlier today OpenAI said the resemblance to ScarJo was a “coincidence”. It was not:
Statement from Scarlett Johansson on the OpenAI situation. Wow:
119
557
4,154
607,746
Hot take on the mess Elon just made, by pushing too far: • Everyone of Elon Musk’s boards now has to consider that he has just flagrantly violated federal law and may go to jail for it. • That is risk to his companies, and because of their fiduciary responsibilities, to his boards. • If anything tanks, lawsuits could be epic. Boards will be asked why they allowed this to go on for so long.
𝙾𝚔 𝚝𝚑𝚒𝚜 𝚝𝚒𝚖𝚎 𝙴𝚕𝚘𝚗 𝙳𝙴𝙵𝙸𝙽𝙸𝚃𝙴𝙻𝚈 𝚟𝚒𝚘𝚕𝚊𝚝𝚎𝚍 𝙵𝚎𝚍𝚎𝚛𝚊𝚕 𝚕𝚊𝚠.
2
1,338
3,024
151,124
GPT-4o hot take: • The speech synthesis is terrific, reminds me of Google Duplex (which never took off). but • If OpenAI had GPT-5, they have would shown it. • They don’t have GPT-5 after 14 months of trying. • The most important figure in the blogpost is attached below. And the most important thing about the figure is that 4o is not a lot different from Turbo, which is not hugely different from 4. • Lots of quirky errors are already being reported, same as ever. (See e.g., examples from @RosenzweigJane and @benjaminjriley.) • OpenAI has presumably pivoted to new features precisely because they don’t know how produce the kind of capability advance that the “exponential improvement” would have predicted. • Most importantly, each day in which there is no GPT-5 level model–from OpenAI or any of their well-financed, well-motivated competitors—is evidence that we may have reached a phase of diminishing returns.
273
366
3,115
1,212,298
Nominee for worst user design choice of the year
86
108
2,985
285,708
BREAKING: Explosive new paper from MIT/Harvard/UChicago. Things just got worse — a lot worse — for LLM’s and the myth that they can understand and reason. The paper documents a pattern they called Potemkins, a kind of reasoning inconsistency (see figure below). They show that LLMs - even models like o3 — make these errors frequently. You can’t possibly create AGI based on machines that cannot keep consistent with their own assertions. You just can’t. “success on benchmarks only demonstrates potemkin understanding: the illusion of understanding driven by answers irreconcilable with how any human would interpret a concept … these failures reflect not just incorrect understanding, but deeper internal incoherence in concept representations” Game over for any hopes of building AGI on a pure LLM substrate. cc @geoffreyhinton, checkmate.
227
577
3,240
417,245
👉Softbank sells entire Nvidia position. 👉Oracle debt downgraded. 👉Meta financing games revealed. 👉OpenAI CEO @sama couldn’t explain how company would meet its $1.4 T obligations. 👉Coreweave drops 20% in a week. You do the math.
163
733
3,165
199,961
OpenAI is in a heap of trouble, and it’s not just text. Long thread why (1/n), based on work with @Rahll
209
564
2,749
1,197,523
A bit over 12 hours later, Trump showed at least 4 more signs of possible dementia today, including more inappropriate foul language (no surprise), more signs of disorientation, and the world’s first effort by a presidential candidate to mimic fellatio in public – yet another remarkable failure to inhibit socially inappropriate responses, confirming my overall impression. Whatever disease he has is likely progressing rapidly.
53
509
2,395
94,202
Have to say I am with LeCun on this one.
61
145
2,416
77,527
SCOOP: OpenAI may lose $5B this year & may run out of cash in 12 months, unless they raise more $, per analysis @theinformation. Investors should ask: What is their moat? Unique tech? What is their route in profitability when Meta is giving away similar tech for free? Do they have a killer app? Will the tech ever be reliable? What is real and what is just demo?
198
488
2,648
964,292
Quote of the day, maybe the year: “*He is not an idiot, Elon Musk is very smart, very tech savvy. He could do a simple Google search of any of the disinformation he has seen in the last week and see it's untrue, but instead he seeks to amplify it and perpetuate it”.
Page One in Edinburgh: “He has vast wealth at his fingertips, and he uses it for some of the most wicked evil I’ve seen.” ⁦@ukpapers$TSLA
63
741
2,053
181,822
losing nearly $4 billion dollars a *month*.
Microsoft’s latest SEC filing quietly exposed that OpenAI lost around $11.5 billion last quarter, based on Microsoft’s 27% ownership stake and a $3.1 billion hit to its own net income. The filings confirm that Microsoft has funded $11.6B of its $13B commitment to OpenAI, with losses directly affecting Microsoft’s earnings under equity accounting rules.
71
591
2,657
143,317
👇Superb new article from @apple AI: “we found no evidence of formal reasoning in language models . Their behavior is better explained by sophisticated pattern matching—so fragile, in fact, that changing names can alter results by ~10%!” 𝗧𝗵𝗲𝗿𝗲 𝗶𝘀 𝗷𝘂𝘀𝘁 𝗻𝗼 𝘄𝗮𝘆 𝗰𝗮𝗻 𝘆𝗼𝘂 𝗯𝘂𝗶𝗹𝗱 𝗿𝗲𝗹𝗶𝗮𝗯𝗹𝗲 𝗮𝗴𝗲𝗻𝘁𝘀 𝗼𝗻 𝘁𝗵𝗶𝘀 𝗳𝗼𝘂𝗻𝗱𝗮𝘁𝗶𝗼𝗻, where changing a word or two in irrelevant ways can give you a different answer. Strongly encourage you to read the whole thread.
1/ Can Large Language Models (LLMs) truly reason? Or are they just sophisticated pattern matchers? In our latest preprint, we explore this key question through a large-scale study of both open-source like Llama, Phi, Gemma, and Mistral and leading closed models, including the recent OpenAI GPT-4o and o1-series. arxiv.org/pdf/2410.05229 Work done with @i_mirzadeh, @KeivanAlizadeh2, Hooman Shahrokhi, Samy Bengio, @OncelTuzel. #LLM #Reasoning #Mathematics #AGI #Research #Apple
96
576
2,311
494,042
What more proof do you need that Elon is living in a bubble than this?
377
97
2,047
151,080
I will always stand in awe of Noam Chomsky. I just sent him my recent essay; 12 minutes later he replied with smart comments, including a point I should have thought to include. He’s 93.
19
119
2,257
Pretending that Elon Musk is “centrist” is the new Flat Earth movement.
187
133
1,909
96,964
I am learning so much from Google’s AI!
106
163
1,981
142,661
Terence Tao reporting on the same problem we have seen endlessly in other domains: LLM’s produce output that “looks” correct but is often deeply wrong and even stupid on careful inspection. I know of no domain in which this is NOT the case. And yet people seem surprised over and over to discover it anew in fresh domains. and non experts often don’t see it at all. No solution to this problem, no AGI. Scaling has not solved it.
Terence Tao says today's AIs pass the eye test -- but fail miserably on the smell test. They generate proofs that look flawless. But the mistakes are subtle, and strangely inhuman. “There's a metaphorical mathematical smell.. it's not clear how to get AI to duplicate that.” Intelligence isn't what looks correct. It's what smells true.
89
281
2,185
246,572
The last few months have been devastating for LLM dreams: • Apple reasoning paper and the ASU mirage paper and many others confirmed that LLMs still can’t solve distribution shift. • GPT-5 came late and fell short. • Turing Award winner Rich Sutton thanked me for my critiques of LLMs and agreed. • Karpathy just said agents aren’t anywhere close, and that AGI is a decade away. • And Hassabis just blew up some wildly overhyped claims from OpenAI about math. Game over, man. LLMs have their place, but anyone expecting the current paradigm to be close to AGI is delusional.
At first, I thought GPT-5 had cracked those math problems on its own. Turns out (as Demis pointed out) GPT-5 just looked up the answers via web search. We really need better peer review for these “AI discovers science/math” claims.
129
234
2,118
359,945
Since @OpenAI still has not changed misleading blog post about "solving the Rubik's cube", I attach detailed analysis, comparing what they say and imply with what they actually did. IMHO most would not be obvious to nonexperts. Please zoom in to read & judge for yourself.
37
666
1,956
𝗪𝗼𝘄! Incredibly damning new study on GenAI from Upwork, via @baldurbjarnason: “Nearly half (47%) of workers using AI say they have no idea how to achieve the productivity gains their employers expect. Over three in four (77%) say AI tools have decreased their productivity and added to their workload in at least one way.” Says Bjarnason, “It’s quite unusual for a study like this on a new office tool, roughly two years after that tool—ChatGPT—exploded into people’s workplaces, to return such a resoundingly negative sentiment.” Many on X may love ChatGPT for work, but much of the outside world feels quite differently. As Bjarnason points out, “it fits with the studies on the actual functionality of said tool: the incredibly common and hard to fix errors, the biases, the general low quality of the output, and the often stated expectation from management that it’s a magic fix for the organisational catastrophe that is the mass layoff fad.”
66
548
1,812
317,824
If all we had was ChatGPT, we could say, hmm “maybe hallucinations are just a bug”, and fantasize that they weren’t hard to fix. If all we had was Gemini, we could say, hmm “maybe hallucinations are just a bug”. If all we had was Mistral, we could say, hmm “maybe hallucinations are just a bug”. If all we had was LLAMA, we could say, hmm “maybe hallucinations are just a bug”. If all we had was Grok, we could say, hmm “maybe hallucinations are just a bug”. Instead we need to wake up and realize that hallucinations are absolutely core to how LLMs work, and that we need new approaches based on different ideas.
247
217
1,858
535,274
OMG! The GenAI honeymoon is definitely over. “A pharma company stopped using Microsoft's Copilot AI tool, with an exec citing high cost and low value.
​ The company used Office 365 Copilot for 500 staff and compared it to middle-school presentations.” -@BusinessInsider
106
336
1,776
288,348
AI backlash reaches new heights
53
277
1,979
48,584
As a graduate of MIT I literally couldn’t watch this to the end; it was too painful. I honestly don’t understand how things got to this point.
"This is the same climate of antisemitism that has led to the massacre of Jews throughout the centuries. This is not just harassment. This is our lives on the line." @MIT student Talia Khan highlights the rise of antisemitism at MIT.
238
141
1,792
423,296
My work here is truly done. Nobody with intellectual integrity can still believe that pure scaling will get us to AGI. GPT-5 may be a moderate quantitative improvement (and it may be cheaper) but it still fails in all the same qualitative ways as its predecessors, on chess, on reasoning, in vision; even sometimes on counting and basic math. Hallucinations linger. Dozens of shots on goal (Grok, Claude, Gemini) etc have invariably faced the same problems. That’s exactly what it means to hit a wall, and exactly the set of obstacles I described in 2022. No hypothesis has ever been given more benefit of the doubt, nor more funding. Pure scaling simply isn’t the path to AGI. Attention was not all we needed. All I am saying, is give neurosymbolic AI with explicit world models a chance. 🫳 🎤
236
250
1,905
230,622
Someday my children will ask me why anyone ever took seriously the thought that systems like this had anything to do with AGI. I honestly have no idea how I will answer. Mass delusion? (Example emailed to me by a friend.)
144
229
1,670
307,847
Remember how Sam Altman told the US Senate he had no “direct” investment in OpenAI? and how they gushed over his apparent selflessness? 👉He didn’t mention that (presumably) he had equity in YC which likely has equity in OpenAI. And now this: 👉“OpenAI Agreed to Buy $51 Million of AI Chips From a Startup Backed by CEO Sam Altman” #candor
140
211
1,868
1,042,856
Prediction: By end of 2024 we will see • 7-10 GPT-4 level models • No massive advance (no GPT-5, or disappointing GPT-5) • Price wars • Very little moat for anyone • No robust solution to hallucinations • Modest lasting corporate adoption • Modest profits, split 7-10 ways
178
197
1,707
665,075
if only @sama had a reputation for keeping to his word, I would be comforted by this. but he doesn’t. and I am not comforted by it, not one bit.. the reality is that yesterday’s statement on government loan guarantees was a trial balloon that a got a ton of flak and this is paragraph after paragraph of spin doctoring, to try to walk that back.
I would like to clarify a few things. First, the obvious one: we do not have or want government guarantees for OpenAI datacenters. We believe that governments should not pick winners or losers, and that taxpayers should not bail out companies that make bad business decisions or otherwise lose in the market. If one company fails, other companies will do good work. What we do think might make sense is governments building (and owning) their own AI infrastructure, but then the upside of that should flow to the government as well. We can imagine a world where governments decide to offtake a lot of computing power and get to decide how to use it, and it may make sense to provide lower cost of capital to do so. Building a strategic national reserve of computing power makes a lot of sense. But this should be for the government’s benefit, not the benefit of private companies. The one area where we have discussed loan guarantees is as part of supporting the buildout of semiconductor fabs in the US, where we and other companies have responded to the government’s call and where we would be happy to help (though we did not formally apply). The basic idea there has been ensuring that the sourcing of the chip supply chain is as American as possible in order to bring jobs and industrialization back to the US, and to enhance the strategic position of the US with an independent supply chain, for the benefit of all American companies. This is of course different from governments guaranteeing private-benefit datacenter buildouts. There are at least 3 “questions behind the question” here that are understandably causing concern. First, “How is OpenAI going to pay for all this infrastructure it is signing up for?” We expect to end this year above $20 billion in annualized revenue run rate and grow to hundreds of billion by 2030. We are looking at commitments of about $1.4 trillion over the next 8 years. Obviously this requires continued revenue growth, and each doubling is a lot of work! But we are feeling good about our prospects there; we are quite excited about our upcoming enterprise offering for example, and there are categories like new consumer devices and robotics that we also expect to be very significant. But there are also new categories we have a hard time putting specifics on like AI that can do scientific discovery, which we will touch on later. We are also looking at ways to more directly sell compute capacity to other companies (and people); we are pretty sure the world is going to need a lot of “AI cloud”, and we are excited to offer this. We may also raise more equity or debt capital in the future. But everything we currently see suggests that the world is going to need a great deal more computing power than what we are already planning for. Second, “Is OpenAI trying to become too big to fail, and should the government pick winners and losers?” Our answer on this is an unequivocal no. If we screw up and can’t fix it, we should fail, and other companies will continue on doing good work and servicing customers. That’s how capitalism works and the ecosystem and economy would be fine. We plan to be a wildly successful company, but if we get it wrong, that’s on us. Our CFO talked about government financing yesterday, and then later clarified her point underscoring that she could have phrased things more clearly. As mentioned above, we think that the US government should have a national strategy for its own AI infrastructure. Tyler Cowen asked me a few weeks ago about the federal government becoming the insurer of last resort for AI, in the sense of risks (like nuclear power) not about overbuild. I said “I do think the government ends up as the insurer of last resort, but I think I mean that in a different way than you mean that, and I don’t expect them to actually be writing the policies in the way that maybe they do for nuclear”. Again, this was in a totally different context than datacenter buildout, and not about bailing out a company. What we were talking about is something going catastrophically wrong—say, a rogue actor using an AI to coordinate a large-scale cyberattack that disrupts critical infrastructure—and how intentional misuse of AI could cause harm at a scale that only the government could deal with. I do not think the government should be writing insurance policies for AI companies. Third, “Why do you need to spend so much now, instead of growing more slowly?”. We are trying to build the infrastructure for a future economy powered by AI, and given everything we see on the horizon in our research program, this is the time to invest to be really scaling up our technology. Massive infrastructure projects take quite awhile to build, so we have to start now. Based on the trends we are seeing of how people are using AI and how much of it they would like to use, we believe the risk to OpenAI of not having enough computing power is more significant and more likely than the risk of having too much. Even today, we and others have to rate limit our products and not offer new features and models because we face such a severe compute constraint. In a world where AI can make important scientific breakthroughs but at the cost of tremendous amounts of computing power, we want to be ready to meet that moment. And we no longer think it’s in the distant future. Our mission requires us to do what we can to not wait many more years to apply AI to hard problems, like contributing to curing deadly diseases, and to bring the benefits of AGI to people as soon as possible. Also, we want a world of abundant and cheap AI. We expect massive demand for this technology, and for it to improve people’s lives in many ways. It is a great privilege to get to be in the arena, and to have the conviction to take a run at building infrastructure at such scale for something so important. This is the bet we are making, and given our vantage point, we feel good about it. But we of course could be wrong, and the market—not the government—will deal with it if we are.
44
149
1,770
117,516
Congratulations, @elonmusk, on Starship’s successful There and Back Again! You will not be forgotten, regardless of what certain myopic ML experts named @Ylecun might have implied. You don’t need to publish to the change the world.
56
80
1,145
80,248
Now we know why Sam Altman went around the world last summer meeting world leaders: his company won’t make it big unless they can convince governments to give them one of the biggest handouts in history.
27
419
1,506
92,348
🎉HUGE news – it means OpenAI and others can no longer hide behind the BS phrase “publicly available”. If they scrape something off the internet that is copyrighted or trademarked they are now obligated to disclose that.
Newsom just signed a new bill requiring Gen AI businesses to publicly disclose their training data 👀
1
410
1,443
81,759
An MIT professor just asked me, how can you be so confident that OpenAI isn’t very close to AGI? Here is a thread of links to a several recent observations that I think show how far we are from robust intelligence. Importantly, all of them are *known* problems, many that I have noted repeatedly for years and even decades. There is no *principled* solution thus far to any of them. 🧵
65
279
1,670
358,660
👉Seismic from @geoffreyhinton: ‘The best hope is for the world’s leading scientists to collaborate on ways of controlling the technology. “I don’t think they should scale this up more until they have understood whether they can control it” nytimes.com/2023/05/01/techn…
76
218
1,527
849,386
The big takeway from tonight: Elon is spending too much time on politics, and not enough on Tesla.
133
92
1,473
85,637
Sam is getting roasted alive. Nobody is buying his bullshit anymore. Glorious.
36
229
1,632
32,463
I literally don’t see how democracy can survive long term if the richest and most powerful people are free to lie at scale with no consequence.
108
249
1,348
131,182
People here are strangely quiet about a private, unelected citizen taking over the U.S. government. A bit like in this cartoon.
65
536
1,530
50,404
Suppose the air conditioning in your car stopped working, and you brought the car into the shop, and the mechanic said, ”I am afraid your car is broken, so I have started dismantling it”. That, in a nutshell, is what Trump and Musk are doing to your country.
I disagree. The education system is broken and needs to be dismantled
95
375
1,565
97,308
God I wish @elonmusk would have taken this bet.
96
79
1,501
96,494
Yann LeCun: AI won’t be used to generate misinformation in science, deliberately or accidentally. It’s not a problem. Actual research: It’s already happening.
2
528
1,305
61,782
I think @elonmusk owes Apple an apology. I won’t say this is perfect (hackers will inevitably find problems), but it seems to me to be quite thoughtfully conceived, and likely is state of the art.
This is from Apple's State of the Union The local model is a 3B parameter SLM that uses adapters trained for each specific feature. Diffusion model does the same thing, adapter for each style. Anything running locally or Apple's Secure Cloud is an Apple model, not OpenAI.
137
120
1,444
310,678
Replying to @SemanticEntity
People pay for samples, and consider stealing to be … stealing
5
14
1,400
31,400
Crikey! “AI worse than humans in every way at summarizing information … the technology might actually make more work for people, not less” @crikey_news
52
316
1,294
117,012
Musicians and Artists, does this sound familiar? From one musician I know:
34
450
1,352
198,904
Woker-than-thou
97
104
1,449
574,200
The number of people simply unable to imagine that the Board acted in a good faith with a legitimate and serious concern boggles my mind. We don’t of course know what happened, but simply dismissing that possibility seems foolish to me, especially given that they were not operating out of financial self interest, unlike all the other actors here.
137
103
1,444
219,167
Never thought I would do this, but I have unfollowed Elon. Reading his relentless distortion is ruining my experience here.
283
58
1,276
103,299
The countdown until we are told that LLMs are “too big to fail” starts now. “We can’t afford to lose to China”, they will say, accepting their multibillion dollar bailouts.
49
264
1,971
614,025
I would very much welcome the thoughts of anyone specializing in neurology. What’s best diagnosis here? Prognosis?
59
94
1,189
93,352
BREAKING NEWS: New @betterup/Stanford study on absolutely nails part of why the ROI on GenAI has been so poor: “Employees are using AI tools to create low-effort, passable looking work that ends up creating more work for their coworkers. On social media, which is increasingly clogged with low-quality AI-generated posts, this content is often referred to as “AI slop.” In the context of work, we refer to this phenomenon as “workslop.”… AI generated work content that masquerades as good work, but lacks the substance to meaningfully advance a given task.”
67
280
1,490
113,688
𝗔𝘀𝘁𝗼𝗻𝗶𝘀𝗵𝗶𝗻𝗴𝗹𝘆 𝗳𝗲𝘄 𝗽𝗲𝗼𝗽𝗹𝗲 𝘀𝗲𝗲𝗺𝘀 𝘁𝗼 𝗵𝗮𝘃𝗲 𝗮𝗽𝗽𝗿𝗲𝗰𝗶𝗮𝘁𝗲𝗱 𝘁𝗵𝗲 𝘁𝗿𝘂𝗲 𝘀𝗶𝗴𝗻𝗶𝗳𝗶𝗰𝗮𝗻𝗰𝗲 𝗼𝗳 𝘁𝗵𝗲 𝗻𝗲𝘄 𝗔𝗽𝗽𝗹𝗲 𝗿𝗲𝗮𝘀𝗼𝗻𝗶𝗻𝗴 𝗽𝗮𝗽𝗲𝗿. I will break it down for you, in four sentences: • It is not just that these systems are doing more pattern recognition than reasoning. • It is that LRM “reasoning models” 𝙘𝙖𝙣 𝙣𝙚𝙫𝙚𝙧 𝙗𝙚 𝙧𝙚𝙡𝙞𝙚𝙙 𝙤𝙣 𝙩𝙤 𝙥𝙧𝙤𝙥𝙚𝙧𝙡𝙮 𝙚𝙭𝙚𝙘𝙪𝙩𝙚 𝙖𝙡𝙜𝙤𝙧𝙞𝙩𝙝𝙢𝙨. • You can’t have reliable AGI without the reliable execution of algorithms. • And we will *never* get to alignment without them.
101
235
1,428
175,312
The fun never ends
115
189
1,318
201,832
⚠️ Everybody who uses LLMs should read this thread: “LLM users consistently underperformed” on multiple cognitive and neural measures.
It’s a hefty 206-page research paper, and the findings are concerning. "LLM users consistently underperformed at neural, linguistic, and behavioral levels" This study finds LLM dependence weakens the writer’s own neural and linguistic fingerprints. 🤔🤔 Relying only on EEG, text mining, and a cross-over session, the authors show that keeping some AI-free practice time protects memory circuits and encourages richer language even when a tool is later reintroduced.
62
255
1,380
124,924
1. OpenAI is a train wreck. 2. They can’t pay the obligations they are taking on and they know it. 3. Guess what? You, the taxpayer, will be left holding the bag,
86
311
1,391
63,459
Could this be the week? We all know the bubble is going to pop; the only question is when.
93
159
1,367
302,108
⚠️ Something terrifying just snapped in place: • Sam Altman has acknowledged wanting to train on everyone's personal documents (Word files, email etc) • ChatGPT has gathered unprecedented amounts of personal data • Sam founded WorldCoin, known for their eye-scanning orb • OpenAI recently put Nakasone (ex NSA) on the board • OpenAI just put in money in a $60M fundraise with a Webcam company and is planning hardware joint venture with them Altman wants to know - and monetize - everything about you. He might succeed.
46
197
633
114,666
Common view; almost certainly wrong. The board’s decision 𝘸𝘢𝘴𝘯’𝘵 a “𝖿𝗎𝖼𝗄 𝗎𝗉”. Instead, it was almost certainly a Hail Mary: 👉This Board’s sole job was to look out for humanity - NOT to protect the brand. 👉They must have seen danger in something Sam was doing 👉They must have known that everyone in the world would be watching, and that their decision would not be popular. 👉They proceeded anyway, despite likely personal costs, and despite knowing they might not succeed, because they felt there was no other option. Even now, their sole goal seems to be make sure that their successors protect the mission. Respect. 💪
But something here really doesn't make sense. It seems like a huge and pointless fuck up by the board and they seem more capable than that
119
150
1,295
2,548,236
Folks, game over. I won. GPT is hitting a period of diminishing returns, just like I said it would.
news: OpenAI's upcomning Orion model shows how GPT improvements are slowing down It's prompting OpenAI to bake in reasoning and other tweaks after the initial model training phase.
118
145
1,256
359,354
Horseshit. Every business in the world has discovered in the last several months that GenAI is not in fact smart enough replace most of their employees. Whatever you are reading from these (often gamed, sometimes contaminated) benchmarks does not reflect real-world reality.
The top AI model is now smarter than 85% of humans. By the end of 2026 it will be smarter than 99.9% of humans. And you think you will still have a job?
66
170
1,319
145,436
I don’t think people quite realize the significance of this moment, so I will spell it out: OpenAI has been bluffing all along, and their CTO just gave away the fact that they are not holding aces. 75% chance that the GenAI bubble will have burst by 12 months from today.
Wow! evidence from OpenAI itself in favor of my conjecture of diminishing returns.
94
229
1,143
248,774
⚠️⚠️⚠️ Everyone—and not just @theinformation—should be genuinely terrified that the richest man in the world has built a Large Language Model that spouts propaganda in his image. This isn’t free speech. It’s a massive, massive thumb on the scale, from one of the most powerful yet unelected people in the world. If that doesn’t scare you, you aren’t paying attention.
Grok 3 is so based 😂
113
169
1,286
111,140
Fantastic essay on AI and art, today in The New Yorker
48
146
1,106
56,616
what almost every amateur who has been sold bullshit about vibe coding is eventually going to encounter
135
176
1,244
242,197
We are watching Elon meltdown in real-time. If his friends cared for him, truly cared for him, they would intervene.
225
51
997
262,891
Replying to @EdKrassen
NBC interview was pre Ukraine invasion. I don’t think it is appropriate to give him a platform now.
46
39
982
74,171
Appalled that @timnitgebru, AI ethics icon, can’t lay off a lovely 87-year-old man, one of the great scientists of our time, whose journalist son was kidnapped and publicly beheaded in an act of political violence. Her power has gone to her head; her humanity has vanished.
165
99
1,165
786,376
A lot of people are missing the funny part. Let me break it down. If OpenAI has actually achieved AGI, they get their software back from Microsoft. Elon’s lawsuit has put them in a position having to prove that they *haven’t* reached AGI, even though OpenAI likes to hint that they have or are close. So they need an expert witness to testify e.g., that their partner Microsoft was full of it when they said that GPT-4 represented “Sparks of AGI”. Which means hiring someone like *me* to save them. 🤣🤣🤣
86
101
1,150
304,680
Sam, I don’t take issue with OpenAI’s achievements, I take issue with the hype and a history of misleading statements (some detailed below). You’ve built something impressive and in many ways groundbreaking. I would be singing OpenAI's praises – it's amazing how you scaled Transformers - if it weren’t for the hype, much of it fanned by you, that has massively overstated what the technology can do today and in the near future, going back to the Rubik's cube demonstration that misrepresented what you had done. I think this hype is harming the world. If I seem frustrated, it’s because of the damage and the wasted potential to the field to which I’ve dedicated most of my life. My only commitment is to the fact that pure scaling won't take us to AGI. The Apple paper everyone is talking about is only the latest of many studies to reinforce that prediction. (I personally have done research for 30 years, published in leading journals, that foreshadowed it. So have many others.) At the same time I have lost faith in you. I stood next to you as you tricked the Senate by saying you had no equity in OpenAI when you had indirect equity in OpenAI via YC. You told the Senate you favored serious AI regulation, and now you don't. You told the Senate you supported compensation for artists, and now you have backpedaled. I told the Senate at the time that I thought you were sincere; in hindsight, you had me fooled. You tricked the world into thinking OpenAI would be Open, when it is not. That it would exist for the benefit of humanity, when mostly it hasn’t. You keep hinting that AGI is close, and that you know how to build it, and to surpass it, but you don’t really have it. (You haven’t even delivered GPT-5 despite many hints.) All of your talk of solving physics has no relation to current reality. Practical robots for most applications (like hoe humanoid robots) are not going to be here in 2027, as almost any roboticist would tell you. And where is the AI safety and alignment plan (and 20% of budget to those things) that you promised? Reports from @_KarenHao and @keachhagey raise all kinds of other questions. Don’t insult me – dare to debate the intellectual substance here, with a neutral moderator in public. (Many major media have offered to host). Is AGI actually close, or not? Is all that talk hype, or are there inherent , lingering problems that scaling hasn't managed to rectify? Can LLMs and LRMs take us to AGI, or do we need other approaches, such as neurosymbolic AI (as I have long argued)? I am game anytime, are you? -- Gary
91
119
1,207
140,124
Replying to @OpenAI
Dear @openAI This is baloney: “We are dedicated to the OpenAI mission and have pursued it every step of the way.” The original mission was to be “unconstrained by a need to generate a financial return”, “not organized for … private gain”, “seek[ing] to open source technology where applicable.” Now Microsoft has an exclusive, 49% of your profits go them, and rather than helping all humanity you are stealing from artists, writers, etc to increase your profits. For good measure you have almost entirely abandoned open-source, and are among the least open AI companies anywhere. Whatever your history with @elonmusk may be, it is not honest to claim that the mission has remained unchanged. You brought receipts; here are mine, both written by you in 2015, one for the public, the other a legal filing. - @garymarcus
49
119
1,112
200,891
So many people are confused about the relation between human cognitive errors and LLM hallucinations that I wrote this short explainer: Humans say things that aren't true for many different reasons • Sometimes they lie • Sometimes they misremember things • Sometimes they fail to think through what they are saying • Sometimes they are on drugs • Sometimes they suffer from mental disorders etc LLMs errors result from 𝙖 𝙙𝙞𝙛𝙛𝙚𝙧𝙚𝙣𝙩 𝙪𝙣𝙙𝙚𝙧𝙡𝙮𝙞𝙣𝙜 𝙥𝙧𝙤𝙘𝙚𝙨𝙨. They don't have (e.g.,) intentions, egos, or financial interests, so they don't lie. They don't take drugs. They don't have emotional states. Instead, LLM "hallucinations" arise, regularly, because (a) they literally don't know the difference between truth and falsehood, (b) they don't have reliably reasoning processes to guarantee that their inferences are correct and (c) they are incapable of fact-checking their own work. Instead, everything that LLMs say -- true or false -- comes from the same process of statistically reconstructing what words are likely in some context. They NEVER fact-check what they say. Some of it is true; some is false. But even with perfect data, the stochastic reconstructive process would still produce some errors. The very process that LLMs use to generalize also creates hallucinations. (In my 2001 book I explain what a different generalization process might look like.) § Importantly, the goal of AGI is not to recreate humans; we don't want AGI to lie or suffer from psychiatric disorders, for example. Rather, the goal of AGI should be to build machines that can reliably reason and plan about a wide swathe of the world. The fact that humans sometimes make errors, sometimes deliberately, sometimes accidentally, in no way takes away from -- or repairs -- the limitations of the current approach. The field of AI will eventually do better, but probably with an AI that is structured differently, in which facts are first-class citizens, rather than something you hope you might get for free with enough data. TL;DR: Don't console yourself with making something that superficially looks like human errors, if you aspire to AGI.
106
300
1,215
263,032
WOW! It’s about time somebody looked into this! Potentially enormous. Breaking @theinformation
36
212
1,001
147,465
𝗜𝗡𝗖𝗥𝗘𝗗𝗜𝗕𝗟𝗘 𝗕𝗥𝗘𝗔𝗞𝗜𝗡𝗚 𝗡𝗘𝗪𝗦 𝗿𝗲 𝗘𝗹𝗼𝗻 𝗠𝘂𝘀𝗸 — and why he may soon wind up in jail. TLDR: Elon’s own lawyers just screwed up, really badly. Longer explanation, background first: • Musk ran a lottery in PA to try to get MAGA fans to register to vote. That’s very likely a violation of federal election law [TBD]. But, separately, it also flouts PA lottery law. • Philadelphia DA brought him to court for it. • In response — and this is the insane part — Musk’s lawyers tried to weasel, alleging in court that “There is no prize” (just people being hired to do ads) and that the winners “are not chosen at random”. 𝗦𝗹𝗶𝗰𝗸 𝗺𝗼𝘃𝗲 - 𝗶𝗳 𝗶𝘁 𝘄𝗲𝗿𝗲 𝘁𝗿𝘂𝗲. 𝗕𝘂𝘁 𝗶𝘁’𝘀 𝗰𝗼𝗺𝗽𝗹𝗲𝘁𝗲 𝗕𝗦. How do we know this? • Because when Elon announced the October lottery — publicly — 𝘩𝘦 𝘩𝘪𝘮𝘴𝘦𝘭𝘧 𝘪𝘯 𝘧𝘢𝘤𝘵 𝘴𝘢𝘪𝘥 𝘵𝘩𝘢𝘵 𝘵𝘩𝘦 𝘱𝘳𝘪𝘻𝘦𝘴 𝘈𝘙𝘌 𝘤𝘩𝘰𝘴𝘦𝘯 𝘢𝘵 𝘳𝘢𝘯𝘥𝘰𝘮. Oops! (“We want to try to get over a million, maybe 2 million voters in the battleground states to sign the petition in support of the First and Second Amendment. … We are going to be awarding $1 million randomly to people who have signed the petition”) Open and shut case. Judges don’t like to be flagrantly lied to and treated like they are morons. They tend to punish people for doing that. And that’s how Elon may in fact wind up in jail. What a colossal own-goal. 𝙁𝙤𝙧 𝙜𝙤𝙤𝙙 𝙢𝙚𝙖𝙨𝙪𝙧𝙚, 𝙩𝙝𝙞𝙨 𝙞𝙨 𝙋𝘼 𝙡𝙖𝙬, 𝙨𝙤 𝙚𝙫𝙚𝙣 𝙞𝙛 𝙏𝙧𝙪𝙢𝙥 𝙬𝙞𝙣𝙨, 𝙝𝙚 𝙘𝙖𝙣’𝙩 𝙥𝙖𝙧𝙙𝙤𝙣 𝙈𝙪𝙨𝙠 𝙛𝙤𝙧 𝙩𝙝𝙞𝙨 𝙤𝙣𝙚.
from Philly courtroom — Musk lawyer says his $1m giveaway is NOT a lottery. "There is no prize to be won" and winners "are not chosen at random," Musk lawyer Chris Gober said. Instead, the $1m is a salary they "earn" to be a spokesperson for Musk's pro-Trump super PAC. Winners are picked based on their "suitability to serve" and their personal story, Gober said. In response, Philly DA team called this a "complete admission of liability" that Musk is running an illegal lottery under PA law. Remember: When Musk announced the giveaway.... in PA nonetheless.... he said, "we are going to be awarding $1 million randomly." DA showed this in court to the judge. Gober argued there's a difference between "randomly" and "by chance," which is why he argued that this isn't an illegal lottery under PA gaming laws.
72
229
958
155,779