Make AI safe again

San Francisco, CA
GCP: We don't need support, our product is too good AWS: We don't need product, our support is too good Azure: We don't need product or support, our sales is too good
3
51
459
We just published our paper on GPT-3! arxiv.org/abs/2005.14165 Proud to be part of this awesome team!
5
77
451
Long time coming, but much more to come!
Claude Sonnet 4 now supports 1 million tokens of context on the Anthropic API—a 5x increase. Process over 75,000 lines of code or hundreds of documents in a single request.
12
11
396
40,722
Proud to share what my team and I have been working on! I use Claude Code almost every day, and wouldn't want to code without it. Just getting started.
Replying to @AnthropicAI
Claude Code has become indispensable for our team. In early testing, Claude completed tasks in a single pass that would normally take 45+ minutes of manual work. Join the limited preview: docs.anthropic.com/en/docs/a…
5
5
133
11,552
I had a great chat with @lennysan ! We talked about everything from the future of AI and the potential for superintelligence by 2028, to the critical importance of AI safety and why we left OpenAI to start Anthropic. If you're curious about what a world with superintelligence could look like and how we're working to make it safe, give it a listen! Listen to the full episode here: piped.video/WWoyWNhx2XU?si=WE1n…
11
7
129
19,153
Excited to announce what we've been working on the last few months: @AnthropicAI! We're looking for aligned researchers and engineers to help build reliable, steerable, powerful AI systems. Ping me if interested, or apply on our site.
2
10
85
This research has been a long time in the making. Super proud of the team! Understanding how language models work mechanistically is the first step to being able to modify that behavior at runtime at a semantic level. This will be a game changer in ensuring AGI is safe.
New Anthropic research paper: Scaling Monosemanticity. The first ever detailed look inside a leading large language model. Read the blog post here: anthropic.com/research/mappi…
3
54
4,412
I've been using versions of Opus 4 for coding for a bit now. The numbers in the chart don't do it justice. Absolute game changer in staying on track, writing maintainable code, and generally doing what I want and expect rather than reward hacking. A+ job, team!
Introducing the next generation: Claude Opus 4 and Claude Sonnet 4. Claude Opus 4 is our most powerful model yet, and the world’s best coding model. Claude Sonnet 4 is a significant upgrade from its predecessor, delivering superior coding and reasoning.
1
2
28
2,718
Love to see this!! Since we first prototyped Claude (and before it was named that), one of the most surprising and impressive things was how empathetic it seemed to be, and that's only improved over time. Try it and see ❤️
New Anthropic Research: How people use Claude for emotional support. From millions of anonymized conversations, we studied how adults use AI for emotional and personal needs—from navigating loneliness and relationships to asking existential questions.
4
2
23
3,454
Had a great chat with @NoPriorsPod @eladgil @saranormous !
New episode drop: @8enmann, cofounder @AnthropicAI - Claude 4 launch - what happens when you act like tokens are free - MCP - specialization and the future of models - safety for agents - economic turing tests
1
21
8,394
This is the real game changer. Anyone can search the web, but searching your own data is a pain. Claude's really good at it.
Replying to @AnthropicAI
Claude can also now connect with your Gmail, Google Calendar, and Docs. It understands your context and can pull information from exactly where you need it.
1
16
2,174
Replying to @tommycollison
CRISPR, according to my friend who works on it at Synthego; commercial rocket flight; self driving cars' impact on real estate prices; many more
16
I've been using our internal prototypes of our MCP-enabled research workflow. It's amazing what Claude digs up. No more manual context stuffing! Hook up your data and give it a shot 🪏🪏🪏
Today we're announcing Integrations, a new way to connect your apps and tools to Claude. We're also expanding Claude's Research capabilities with an advanced mode that searches the web, your Google Workspace, and now your Integrations too.
1
1
15
2,222
Most underrated library imo
The Claude Code SDK now supports custom tools and hooks directly in code. Additionally, we’ve refreshed all our docs with complete references and 10 new guides on how to utilize the SDK.
2
14
2,833
It's been great working with you @henrythe9ths ! Let's get some more founders in here 🚀
4
1
15
3,107
Which IDE do you use as your daily driver?
39% VSCode
39% Cursor
4% Jetbrains
18% Other
28 votes • Final results
1
7
447
I'm one of the first authors of GPT-3 and a founding member of Anthropic. You can definitely have impact now, DM me if you want to chat.
8
284
Claude gets you
7
2,071
Replying to @BlancheMinerva
Deep in Gopher's appendix arxiv.org/abs/2112.11446
1
6
Replying to @jackclarkSF
GPT-2 is why I went back to OpenAI after being laid off 2 years before. Then ended up building GPT-3 🎉
1
6
1,038
Replying to @sdand @AnthropicAI
Pricing is the same as our normal models cdn2.assets-servd.host/anthr…
5
111
Replying to @jasoncbenn
Anyone eh? 😉
1
5
Replying to @MasakhaneNLP
Update coming soon! Thanks for the feedback
3
Replying to @AmandaAskell
sleep & exercise
5
Replying to @amasad
Well deserved! Would love to see log scale for y axis
1
235
I just published “Why AI research may be accelerating faster than experts realize” medium.com/p/why-ai-research…
3
I just published “How I made this impermanent wall art” medium.com/p/how-i-made-this…
3
Replying to @Plinz
Working on it!
In "Language Models (Mostly) Know What They Know", we show that language models can evaluate whether what they say is true, and predict ahead of time whether they'll be able to answer questions correctly. arxiv.org/abs/2207.05221
1
3
I cut a circle of of an n95 and taped it over the valve. Best of both!
1
3
I just published “The best dark chocolate 🍫” medium.com/p/the-best-dark-c…
3
All of section 4 and an additional appendix discuss this issue in the paper. Generally we found it to matter for a few datasets, which we marked with asterisks in our results section.
2
I just published “A game console for the internet” medium.com/p/a-game-console-…
1
2
I just published “How I lost 20 lbs in 3 months” medium.com/p/how-i-lost-20-l…
2
I just published “What Her teaches us about being human” medium.com/p/what-her-teache…
2
I just published “What I learned at a 10 day meditation retreat” medium.com/p/what-i-learned-…
2
Replying to @gwern
I'll be around if you'd like to chat!
1
2
I just published “The best chocolate ice cream brand” medium.com/p/the-best-chocol…
1
2
Replying to @jachiam0
Ironic ad placement
2
5 friends tried to draw a bike from memory after reading arxiv.org/abs/2006.06666 Mine is the top left.
2
Replying to @benkuhn
"The talent density will increase until morale improves"
2
56
I just published “9 questions to meditate on” medium.com/p/9-questions-to-…
1
2
Replying to @noahmacca
- Anomaly detection (blood?) - self cleaning or at least "clean me" alerts so it always gets a good image - frequency would be a challenge if you don't always use that toilet. Some way to log manually - real time courtesy flush rec 😉
1
57
Replying to @karinanguyen
Actually, success is everything other than growth. Growth is the necessary evil you tolerate in order to continue to scale with the needs of the business. For my teams in particular, maintaining and innovating on a service used by millions of people just isn't possible w 4 eng.
2
326
Replying to @erijohnt
Lizardman constant for pricing experiment
2
101
Replying to @tommycollison
Survivor bias
2
Replying to @jackclarkSF
Actually, today's AI systems already consume human attention and interaction eg YouTube, FB, etc content recommendation systems
1
2
I just published “How to decide what your team should do” medium.com/p/how-to-decide-w…
2
I just published “Moonlight” medium.com/p/moonlight-19a8e…
1
Fixed! Thanks for the report
1
1
31
I just published "How to run for people who hate running" benjmann.net/how-to-run-for-…
1
I'm also doubtful of BCI/humans catching up. Timelines are too short, FDA too restrictive, wetlabs too hard. If anyone uses recursive self improvement with unlimited capital then we're probably screwed. Otherwise relying on alignment techniques is my top choice
1
I just published “Who are your real friends?” medium.com/p/who-are-your-re…
1
Replying to @yaroslavvb
IIUC, when using a good reward model, argmax actually works fine arxiv.org/abs/2009.01325
1
1
Replying to @jackclarkSF
But much of that will likely be human curated?
1
1
I think her question was specifically about startups like Starcity, rather than houses that never meant to franchise
1
1
Prompt engineering is a hiccup in the development of good tech, unless you also consider the skill of asking a well meaning human for what you want prompt engineering.
1
Replying to @karpathy
Tampopo
1
Replying to @Steve_Yegge
Great feedback! Just getting started here. It does support multimodal input - you can either paste an image right into the TUI or you can reference an image path on disk.
1
317
Replying to @arram
Slightly less ambitious version: verily.com/solutions/debug/
1
Replying to @Ben_Reinhardt
The Elephant in the Brain by @robinhanson ?
1
I just published “What I learned in physical therapy” medium.com/p/what-i-learned-…
1
Replying to @kandouss
My guess is the paint wouldn't survive the explosion. Air is gentle
1
107
Replying to @mortenjust
Google Photos
1
Replying to @prafdhar
This is like pokefusion for people pokemon.alexonsager.net/
1
I agree that tweet is too glib, but even the other tweets in the series are more nuanced
Replying to @AnthropicAI
This means that if we have a target behavior (e.g. non-discrimination) we may be able to nudge models to achieve that target using IF/CoT prompting if RLHF alone is not sufficient. But we must be careful to check whether RLHF + prompting causes the models to overshoot the target.
1
1
196
Replying to @kandouss
Seems like a big if, ie the human actors are not contributing anything of unique substance?
1
1
168
Replying to @tayroga
"Love of water" is always at the top of my list for eng hiring
1
Replying to @RichardMCNgo
Cal Newport makes this argument well with more nuance in "So Good They Can't Ignore You", where instead he says "gain rare and valuable skills, and passion will follow"
1
1
I just published “When eating a burrito is a terrible idea” medium.com/p/when-eating-a-b…
1
Replying to @andy_l_jones
Why burn to the ground when you can cut it down and sell the wood to be built into structures for extra sequestration?
1
1
I just published “How to do everything wrong on your first solo backpacking trip” medium.com/p/how-to-do-every…
1
Replying to @shariq
You can apply the same argument to productivity/procrastination, hence complice.co/
1