Pinned Tweet
i joined prime 12 months ago crazy what you can build in a year with a team like this `prime lab`
The next wave of AI will not be won by better prompts. It will be won by systems that learn from experience. Today, Prime Intellect Lab is out of beta, open for you to start training your own models. The era of self-improving agents is here.
26
20
689
88,384
deepseek is passing the chatgpt detector tests with flying colors
109
1,229
42,844
1,041,756
it's interesting that 1.5 billion parameters is all you need to crush math competitions, but you need like 15 trillion to make the model be funny maybe humor is the right measure of true intelligence
380
1,287
17,558
12,761,942
grinding til 3am isn’t really a flex if you’re waking up at noon every day
170
207
13,072
666,999
Replying to @carmguti
TAke a look, y'all: IMG_4346.jpeg
11
85
11,478
211,329
idk about that, most of my friends know what it is
black pill: pretty sure a majority of the population has no idea what sampling bias is
66
435
11,487
358,742
"don't write or run any code, just explain possible approaches and their tradeoffs"
86
326
7,120
274,329
i have evidence that openai is training on deepseek's user interface
This is live for me now, and for some of my mutuals. GPT-4o uses chain of thought now for every response if you toggle it on.
38
387
6,871
224,630
it’s kind of funny that the “main ai guy” is basically nontechnical
173
179
6,524
940,986
Replying to @theo
45
92
5,234
80,300
my model picker still looks like this lol
423
65
5,123
726,742
i booked some flights today and i really don't know what part of the process could've been made much easier by ai. you just type in cities/dates, see options for times + prices, choose, enter your credit card info, then click buy. which part of that should be automated?
In 2030, what % of flights, hotels, and vacations will be fully booked by AI?
165
92
4,935
520,223
drop the “Deep”. just “Research”. it’s cleaner
Today we’re launching Research, alongside a new Google Workspace integration. Claude now brings together information from your work and the web.
24
123
4,835
232,332
rent is dropping significantly in all of the top 10 cities as ranked by magnitude of rent decrease
Apartment rents are collapsing
97
68
4,910
444,833
which is larger, 52.8 or 69.1?
235
157
4,285
448,472
Deep Research goes so hard if you spend 20 minutes writing your prompt
82
96
4,128
450,270
Sankey diagrams are a really intuitive way to visualize this kind of information flow. it’s a shame what happened to them
Has anyone used Monarch as a personal finance app? I used to use Mint but it shut down & recently I’m getting bombarded with ads showing the pretty Sankey diagrams that Monarch makes. It’s like they know my weaknesses.
64
82
4,094
195,913
which AI division?
*META LOOKING AT DOWNSIZING A.I. DIVISION OVERALL: NYT
20
86
4,052
282,851
every dice roll i’ve ever done has either missed or beaten the expected value
Among every founder Ive ever known or invested in, one thing is true: none of them hit the revenue goal they say they'll do. They all either miss it or beat it. Every time.
55
57
4,034
175,178
you can literally make 6 figures for just having a hobby
hobbies are just excuses for unambitious people to fill their free time
61
134
3,921
423,836
if zuck is throwing around these kinds of numbers why doesn't he just buy anthropic lol
138
46
3,846
314,253
i’ve been significantly more productive since the whole ai thing really took off but it’s been like 10% using ai for stuff and 90% the whole “you have 2 years to escape the permanent underclass” thing
69
149
3,863
246,682
happy saturday for the price of a cup of coffee, you can rent a 4090 for the day and do some random experiments on small models
64
165
3,640
287,900
probably the highest-impact 6 lines of code i've written in my life
55
120
3,480
533,317
LLMs know way too many facts. you can ask Llama 1B about the history of pizza and it gives a decent answer. it shouldn't be able to do that. it should just do a google search. use those weights for something else
157
130
3,407
302,810
it’s kinda hilarious that the quora ceo has been on the board of openai since 2018 and they still missed this hard
quora has been destroyed by LLMs. traffic -33% in just 6 months!
53
76
3,345
178,694
this is how you know someone is cracked. no proofreading whatsoever, resume speaks for itself
32
32
3,188
249,703
anthropic employees should use twitter slightly more, openai employees should use twitter slightly less, xai employees should use twitter slightly differently
81
68
3,218
144,797
xAI Head of Product announces that Grok 4 is “the Antichrist”
Going from an office where AI researchers are building the Antichrist to my living room where my girlfriend is watching Love Island is one of the most drastic transitions in the known universe
62
104
3,103
201,454
imagine if gas stations didn't tell you how many gallons you were getting because car mileage was a trade secret and the gas station owned the car companies and you could either buy way overpriced gas per-mile or a monthly "max gas subscription" that turns off randomly sometimes
We’re rolling out new weekly rate limits for Claude Pro and Max in late August. We estimate they’ll apply to less than 5% of subscribers based on current usage.
94
161
3,085
239,883
"IMO Medalist, Math PhD, or Masters in Data Science"
IMO medalist / Masters or PhD and $45-100/hour lol
27
85
2,874
242,714
now this is a hiring page
43
94
2,752
353,925
I just read this new paper from Google and I’m absolutely buzzing 🤯 The core idea is almost offensively simple: ditch recurrence and convolutions, and use only attention. That’s it. And somehow…it unlocks a whole new regime of performance, scale, and simplicity. Here’s what blew my mind: - No recurrence, full parallelism. Tokens don’t have to march one step at a time anymore. Training lights up the whole sequence at once. Throughput goes way up, iteration cycles shrink. - Multi-head attention = multiple viewpoints. The model learns to focus on different relationships simultaneously. Syntax, semantics, long-range dependencies—captured in parallel. - Positional encodings without the baggage. You still get order awareness, but with zero recurrence overhead. - Encoder–decoder stacks that actually scale. Deep, clean, modular blocks with residual connections and layer norm that just…train. Reliably. - Results that speak for themselves. Stronger quality on translation benchmarks with dramatically better efficiency—and a simpler pipeline. Why this matters (right now): - Speed → strategy. When training is parallel and stable, you iterate faster, test more hypotheses, and ship better models sooner. - Quality → product. Long-range reasoning and richer representations turn into real-world wins: better search, smarter assistants, more robust generative systems. - Simplicity → leverage. Fewer moving parts, clearer abstractions, and a backbone that generalizes across tasks. This is an architectural blueprint, not a one-off trick. What I’m changing this week: - Refactoring any sequence stack I touch toward a Transformer backbone. - Re-thinking compute budgets around parallelism (bigger effective context, larger batches, faster turnaround). - Making attention the first-class citizen in modeling discussions—design defaults, not an afterthought. This paper feels like an inflection point. If you’re building anything with sequences—language, code, planning, you name it—read it, internalize it, and rethink your roadmap. The title isn’t marketing. Attention really is all you need. #AI #MachineLearning #NLP #Transformers #DeepLearning #GoogleAI #Attention #Research #ProductEngineering #Builders
226
143
2,517
320,703
the new llama 4 models are so advanced that they require versions of hf transformers that haven't even been invented yet
11
33
2,541
162,057
remember when yann lecun said this and everyone was like "shut up dummy" and scared him off this website
Why can AIs code for 1h but not 10h? A simple explanation: if there's a 10% chance of error per 10min step (say), the success rate is: 1h: 53% 4h: 8% 10h: 0.002% @tobyordoxford has tested this 'constant error rate' theory and shown it's a good fit for the data chance of success declines exponentially
75
93
2,471
288,512
we’re truly about to enter a golden era of jane street and robinhood rugging the spreads on everyday americans vibe-trading options based on hallucinated GPT slop analysis
working on bringing sell side research from banks to perplexity for everyone. stay tuned.
38
83
2,461
205,245
zuck hired wang for 0.5-1% equity he’s basically a founding engineer
16
34
2,382
279,231
it’s DeepMind > OpenAI > Anthropic > xAI and all of those separations are quite large
which AI research lab is the most innovative?
63
65
2,197
353,102
the most important equity offer you’ll ever get has an 18 year vest with a 9mo cliff
51
53
2,210
176,973
if you’re building a RAG system in 2025 just build a good search engine backend + let the model query it
79
73
2,150
371,476
oh he’s making a content house. he’s gonna have an army of tiktokers selling the product all day while adding features to it. wow. will certainly be interesting to watch.
im hiring 50 interns in san francisco for @trycluely. $50/hr. + founding engineers cluely .com/careers
31
23
2,098
237,183
11k youtube views on this already is kinda wild to me lol thanks for checkin it out :)
40
121
2,034
129,951
“why aren’t they talking about R1 in the newspaper” the average american is using 4o-mini on chatgpt [dot] com without an account and doesn’t know or care about o1
56
40
1,964
121,960
zuck fumbled hard by not answering the question about the person whose advice he values most. like dude you have a wife lol
47
10
1,995
895,622
is it just me or has o1-preview been degrading lately?
54
79
1,960
113,171
already exists
An idea. Something half Bell Labs, half Y Combinator. Rather than funding young people with business ideas, fund young people with research ideas. And only then give them VC funding after their idea works and has potential market value
15
58
1,887
190,408
(yes i am fully aware that these do not work)
5
4
1,859
67,268
he was too early, but he was right. Large Action Model all the way down
97
39
1,929
685,504
been learning a lot about LLMs etc over the past year, organized some of my favorite explainers into a “textbook-shaped” resource guide wish i’d had this at the start, maybe it can useful to others on a similar journey genai-handbook.github.io
34
349
1,886
233,901
RL is hard to explain to people because it doesn't really make any sense without internalizing strong intuitions about everything going on here
68
98
1,873
258,871
we’re hiring for lots of junior roles right now. apprenticeships, basically. can be part-time, remote, international. if you really want to get our attention for these, share a cool env to the hub, <300 LOC. we’ll give full-time offers to top performers after. we can sponsor.
Called it last year. Junior roles down 23%. Senior roles up 14%. Harvard study tracked 285,000 firms. Results: Before AI: 1 senior + 3 juniors = 4 person team After AI: 1 senior + Claude = same output
44
84
1,888
416,000
it's simple, really. GPT-4.1 is o3 without reasoning, and GPT-4.1-mini is o4-mini without reasoning. o4-mini-low is GPT-4.1-mini with just a little bit of reasoning. o1 is 4o with reasoning, o1-mini is 4o-mini with a little bit of reasoning, o3-mini is 4o-mini with reasoning that's like better but not necessarily more, and o4 is GPT-4.5 with reasoning.
105
76
1,825
208,269
update: i joined @primeintellect :) cannot describe how excited i am to be joining such an incredible team and mission. there is a dire shortage of labs who are truly embracing open-source research. it’s hard to get the incentives right. you need a business model where open-sourcing your work is positive-sum; being a GPU marketplace is a really good one. prime intellect has been doing incredible work to advance the frontiers of decentralized training and inference, and it is only the beginning. my own goal is to continue along the directions of my recent projects and musings, but bigger, bolder, more real: advancing open research and infrastructure for agentic RL. towards open-source AGI 🚀
203
36
1,859
213,615
Of course that’s your contention. You’re a first-year AI engineer who learned the hard way that RAG is more than vector search. You just finished reading “Building effective agents” and were convinced to migrate from LangChain to Model Complex Protocol, until next month when
64
109
1,771
138,390
the vast majority of american knowledge workers are not very good at working with LLMs and don’t have a great sense of what tasks are within scope of current capabilities. shocking
we are in end game.
105
88
1,740
179,422
most things that people call “agents” should really just be called pipelines
111
101
1,671
140,748
none of the big labs have shipped anything resembling lifelong memory. they haven't really even shipped RAG. none of the open-source stuff works well enough to be useful without intense fiddling for very constrained workflows. this is the real bottleneck now, not "intelligence"
60
76
1,678
175,853
being good at ML systems helps you run more experiments. being good at ML theory helps you run less experiments
34
96
1,649
92,886
just a minor version bump. booooring
34
49
1,679
183,793
you can just RL a model to play Wordle
34
71
1,664
128,678
karpathy is basically the michael jordan of ai educators. except for, well, you know
35
47
1,592
178,043
i'm increasingly convinced that "transformative ai" is going to look like an abundance of specialized models for everything from drug design to weather sims to robotics to supply chains, not one agent to rule them all. we're going to need a lot more ai researchers
111
104
1,600
113,012
Roblox CEO David Baszucki has said gaming for a living could replace labor in a post-AI world
Robinhood, $HOOD, CEO Vlad Tenev has said investing for a living could replace labor in a post-AI world
58
32
1,499
145,452
today was my last day at @morganstanley. the ML research team there has been a wonderful home for the past 2 years. i’ve learned more than i ever could have imagined about LLMs, markets, responsibility, and how things work in the real world. i’ve made many great friends, and have nothing but positive things to say about the research culture there. it’s a one-of-a-kind lab unlike anything else in finance. but it’s time for a new adventure. will say more very soon about what’s next. incredibly excited to get to work. the arena awaits.
72
7
1,497
121,379
was just in an all-waymo traffic jam
70
23
1,486
103,274
i feel like you shouldn’t call your paper that
Replying to @jiqizhixin
AlphaGo Moment for Model Architecture Discovery Paper: arxiv.org/abs/2507.18074
35
18
1,457
149,030
if you’re doing a PhD in a quantitative discipline you should print this out and hang it above your desk lkozma.net/inequalities_chea…
7
141
1,399
157,559
wow. Step Mom is going to become even more beautiful
1/ Today we’re proud to announce a partnership with @midjourney, to license their aesthetic technology for our future models and products, bringing beauty to billions.
23
22
1,447
130,292
just now learning that all of the cracked ml twitter anons are like 19 years old
39
16
1,381
92,377
TLDR: codex is so good that people kept trying to use it for harder tasks and it didn’t do those as well on those and then people just assumed the model got worse
We promised unprecedented transparency for Codex and to take the reports of degradation seriously, despite seeing incredible growth week over week. Here is our report and what we have found over the last seven days docs.google.com/document/d/1…
38
33
1,444
249,590
lmao
35
15
1,392
92,882
what if they added a file explorer and branch navigation and a merge resolver and debuggers and linters and build systems and markdown rendering and multiplexing and a plugin store
With Gemini CLI's new pseudo-terminal (PTY) support, you can run complex, interactive commands like vim, top, or git rebase -i directly within the CLI without having to exit, keeping everything in context.
40
96
927
117,760
once you understand HTTP, you never see the internet the same way
80
25
1,342
111,835
AI isn't actually intelligent. LLMs aren't reasoning. neural nets aren't thinking. computers don't actually compute. algorithms are fake. circuits are made up. logic is a social construct. math is just an illusion
117
95
1,345
75,411
it would take all of the H100s Nvidia made last year to run 498k copies of R1
44
26
1,328
117,411
the year is 2030. python 5.0 now compiles directly to Rython, borrow-checked, zero-cost, no more gc. pip defaults to uv add, and tokio ships with stdlib. the GIL has been replaced with lifetimes, and del wraps std::mem::forget. we still argue about tabs vs spaces.
Today, we’re announcing the preview release of ty, an extremely fast type checker and language server for Python, written in Rust. In early testing, it's 10x, 50x, even 100x faster than existing type checkers. (We've seen >600x speed-ups over Mypy in some real-world projects.)
44
54
1,348
91,160
this is like if everyone’s LK-99 replication attempts actually worked
12
57
1,236
63,062
benchmark idea take a repo with >10k lines of code + high test coverage delete a random file goal: rewrite the file + pass all tests
52
40
1,234
85,877
sometimes i get linkedin DMs from quant recruiters mentioning that a fund is starting a machine learning team and i’m like what do you mean “starting”
17
8
1,225
68,359
it's kind of a "King of England" situation
24
20
1,238
187,466
i first got into agentic programming about 18 years ago
31
53
1,225
55,877
thoughts on karpathy interview: - agree with him on pretty much everything - literal transformative AGI is not imminent - that's fine + doesn't mean it's a bubble - "normal technology" that improves productivity + growth is very valuable - the market already reflects this
59
40
1,243
197,002
what’re the best relaxing background YouTube video channels?
18
74
1,218
81,111
i miss gemini-2.5-pro-exp-03-25 so bad :(
56
32
1,189
87,031
guy who quits xAI because of “Ani” and joins MSL to work on “Step Mom”
40
24
1,163
111,462
people stopped working on ARC-AGI because they realized it was too hard
41
37
1,151
121,356
coding on your iphone: - acquire mac mini - set up dotfiles, nvim etc - install ssh client on phone - ssh to mini - npm install -g @anthropic-ai/claude-code - scripts/tools for web search, e2b/docker, cli chat - use git branches + CLAUDE.md aggressively - profit?
46
50
1,159
179,343
$2K per month is almost as expensive as an actual PhD student
NEW w/ @coryweinberg: OpenAI is doubling down on its application business. Execs have spoken with investors about three classes of future agent launches, ranging from $2K to $20K/month to do tasks like automating coding and PhD-level research: theinformation.com/articles/…
44
18
1,133
90,804
gen z boss and o3-mini gen z boss and o3-mini
21
44
1,137
42,519
if you're a later-year PhD student thinking about internships for next summer, my team at Morgan Stanley (ML Research) is hiring! we're a tight-knit team of ~20 researchers working on everything from time-series to RL to diffusion to LLMs, both applications and fundamentals. we have compute, publishing freedom, and an endless supply of hard and interesting problems. feel free to reach out to me with any questions!
20
93
1,109
335,724
i’m sure he knows plenty *about* computers and ai and everything but like he dropped out at 19 to be a business guy and has never publicly been an engineer or researcher at any point in his adult life
13
9
1,115
80,583
Replying to @creatine_cycle
it’s called vibe investing
10
11
1,098
23,951
very confused why you’d pick Anthropic of all labs as the one to protest
58
13
1,088
98,634
"why are you so good at posting" i spent the better part of a decade studying algorithmic game theory, recommendation system feedback loops, and information spread in social networks
46
27
1,098
87,967
spending everything on Alexandr Wang
54
27
1,076
118,889
WOW! 🤯 this groundbreaking dataset from Meta’s Chief AI Scientist has revolutionized the way that we understand vision 👀 🚀 is this one of the highest-impact releases of all time?? ⏳🔥 10 crazy examples below: 🧵
97
45
1,090
153,338
one of the following must be true: - anthropic is very efficient at training but terrible at serving inference - sonnet is way larger than V3 - anthropic allocates very few GPUs to inference and serves at 95%+ margins - deepseek out-engineered anthropic for training efficiency
41
24
1,077
86,180
21
40
1,047
137,736
2+ years of MCP experience required
80
57
1,040
69,595
Apple is SO far behind in AI, they’ve barely shipped ANYTHING unrelated, the new iOS and macOS are an absolute pleasure to use, there are so many clever features which smartly integrate info across apps and workflows, things feel automatic and intuitive, can’t imagine switching
81
16
1,025
103,361
sorry to the haters but this is a very compelling sequence of product demos. the big question is they can get a cursor-like market share lead before openai clones it
introducing @cluely. today is the start of a world where you never have to think again. we just killed 9 industries (thread):
66
20
1,016
190,369