Charles Foster · May 22, 2023 · 5:09 PM UTC

Charles Foster

Pinned Tweet

Charles Foster @CFGeek

22 May 2023

Running list of conjectures about neural networks 📜:

172

41,554

Charles Foster · Feb 28, 2025 · 5:56 AM UTC

Charles Foster @CFGeek

28 Feb 2025

If you’re bored with today’s neural net architectures, then my advice to you is to start training models to use an external memory, now that RL is finally working.

739

60,397

Charles Foster · Mar 27, 2023 · 8:07 PM UTC

Charles Foster @CFGeek

27 Mar 2023

The normalization scheme that DeepMind researchers came up with for their "linear recurrent unit" (LRU) is a nice example of how it is possible to predictably engineer circuits in artificial neural networks, when you know what you're doing. A thread:

Image from the preprint "Resurrecting Recurrent Neural Networks for Long Sequences", with normalization term highlighted and question marks next to it

ALT Image from the preprint "Resurrecting Recurrent Neural Networks for Long Sequences", with normalization term highlighted and question marks next to it

646

151,038

Charles Foster · Oct 19, 2025 · 11:02 PM UTC

Charles Foster @CFGeek

19 Oct 2025

Just read their paper. Looks like they re-invented an existing method known as context distillation (or merely re-branded it for their startup). No mention of prior work, sadly. Links to papers in thread.

Bread

@ai_bread

18 Oct 2025

Announcing Bread Technologies. We’re building machines that learn like humans. We raised a $5 million seed round led by Menlo Ventures and have been building in stealth for 10 months. Today, we rise 🍞

499

90,987

Charles Foster · Oct 7, 2025 · 6:20 PM UTC

Charles Foster @CFGeek

7 Oct 2025

We're working on it!

Lisan al Gaib

@scaling01

5 Oct 2025

guys pleeease I need to see Sonnet 4.5 on this

468

69,438

Charles Foster · Jul 10, 2025 · 6:07 PM UTC

Charles Foster @CFGeek

10 Jul 2025

Before you say “this isn’t surprising”… Yes, it is. We got people to preregister their expectations, and even folks who are extremely in-the-know about AI coding abilities still failed to predict this result. Your *vibes* are not reliable indicators of productivity effects.

METR

@METR_Evals

10 Jul 2025

We ran a randomized controlled trial to see how much AI coding tools speed up experienced open-source developers. The results surprised us: Developers thought they were 20% faster with AI tools, but they were actually 19% slower when they had access to AI than when they didn't.

424

24,493

Charles Foster · Nov 4, 2025 · 5:50 PM UTC

Charles Foster @CFGeek

4 Nov 2025

METR: “What is my purpose?” ALL: “You put new AI models on The Graph.” METR: “Oh my god...”

Lisan al Gaib

@scaling01

5 Oct 2025

guys pleeease I need to see Sonnet 4.5 on this

560

54,839

Charles Foster · Apr 9, 2024 · 4:36 AM UTC

Charles Foster @CFGeek

9 Apr 2024

YES! If you initialize a LoRA layer based on the SVD of the original weight matrix (with its top singular values & vectors), you get significantly better fine-tuning results. This is a straight-up free lunch, as far as I can tell.

Aran Komatsuzaki

@arankomatsuzaki

8 Apr 2024

PiSSA: Principal Singular Values and Singular Vectors Adaptation of Large Language Models Significantly improved finetuned perf by simply changing the initialization of LoRA's AB matrix from Gaussian/zero to principal components of W repo: github.com/GraphPKU/PiSSA abs: arxiv.org/abs/2404.02948

319

32,329

Charles Foster · Dec 9, 2023 · 6:53 PM UTC

Charles Foster @CFGeek

9 Dec 2023

What excites me most about the rising tide of RNNs/SSMs is that it could let the fields of machine learning and computational neuroscience use the same modeling tools.

ALT Figure from "Attractor and integrator networks in the brain". Link: https://www.nature.com/articles/s41583-022-00642-0

280

59,338

Charles Foster · Oct 5, 2023 · 8:25 PM UTC

Charles Foster @CFGeek

5 Oct 2023

Note: sparse coding is an *established* method for disentangling representations. Anthropic did not invent it, nor did they claim to. If their new results seem surprising, now's a great time to revisit the older literature (Olshausen, Kanerva, etc.).

Diagram showing sparse coding with an overcomplete basis set, from Olshausen & Field, 1997. Link: https://www.sciencedirect.com/science/article/pii/S0042698997001697

ALT Diagram showing sparse coding with an overcomplete basis set, from Olshausen & Field, 1997. Link: https://www.sciencedirect.com/science/article/pii/S0042698997001697

222

35,323

Charles Foster · Mar 1, 2024 · 5:09 AM UTC

Charles Foster @CFGeek

1 Mar 2024

Wow! Papers from two different teams—one from academia and one from Google DeepMind—with the same finding: linear recurrence + local (sliding window) attention is your best bet if you want an efficient alternative to global attention.

@_akhaliq

1 Mar 2024

Simple linear attention language models balance the recall-throughput tradeoff Recent work has shown that attention-based language models excel at recall, the ability to ground generations in tokens previously seen in context. However, the efficiency of attention-based models is

211

34,098

Charles Foster · Jul 29, 2023 · 2:13 AM UTC

Charles Foster @CFGeek

29 Jul 2023

Stability changed the name of these models to "Stable Beluga 1/2" and quietly removed the sentence of the blog post that mentioned they used two unnamed LLMs to generate their dataset. (This likely means they used OpenAI models, in clear violation of ToS) web.archive.org/web/20230721…

Charles Foster @CFGeek

21 Jul 2023

@EMostaque @lcastricato

Goose chasing man meme, with "Which LLMs did you generate those examples with?... Which LLMs generated your dataset, [blurred expletive]?!" and a screenshot from the blog post this is responding to, with a quote displayed reading "With this approach, we generated 500,000 examples with one simpler LLM model and an additional 100,000 with a more sophisticated LLM model."

ALT Goose chasing man meme, with "Which LLMs did you generate those examples with?... Which LLMs generated your dataset, [blurred expletive]?!" and a screenshot from the blog post this is responding to, with a quote displayed reading "With this approach, we generated 500,000 examples with one simpler LLM model and an additional 100,000 with a more sophisticated LLM model."

202

88,669

Charles Foster · Oct 31, 2025 · 2:04 AM UTC

Charles Foster @CFGeek

31 Oct 2025

We put the model in a test and then we steer the model away from thinking “I am in a test” and then we steer the model away from introspecting “I am being steered away from thinking «I am in a test»”

Tim Hua 🇺🇦@Tim_Hua_

30 Oct 2025

Problem: AIs can detect when they are being tested and fake good behavior. Can we suppress the “I’m being tested” concept & make them act normally? Yes! In a new paper, we show that subtracting this concept vector can elicit real-world behavior even when normal prompting fails.

206

32,161

Charles Foster · Mar 6, 2025 · 8:56 PM UTC

Charles Foster @CFGeek

6 Mar 2025

A short thread of news 🧵 (1/3) I’ve joined the policy team at METR! Rapid AI changes will require measuring & addressing potential new threats far more quickly.

207

10,862

Charles Foster · Oct 16, 2025 · 7:36 PM UTC

Charles Foster @CFGeek

16 Oct 2025

Funnily enough, this Anthropic co-founder gave a talk that Sonnet 4.5 can't engage with. Mentions of bioweapons trigger its safety filters.

Jack Clark

@jackclarkSF

13 Oct 2025

Technological Optimism and Appropriate Fear - an essay where I grapple with how I feel about the continued steady march towards powerful AI systems. The world will bend around AI akin to how a black hole pulls and bends everything around itself.

201

18,930

Charles Foster · May 23, 2025 · 6:14 PM UTC

Charles Foster @CFGeek

23 May 2025

If AI progress totally stalled today, most white-collar job tasks wouldn’t be automated within the next 5 years

Dwarkesh Patel

@dwarkesh_sp

22 May 2025

"Even if AI progress totally stalls, it's sufficiently easy to collect data on all these different white collar job tasks that we should expect to see them automated within the next 5 years."

187

19,275

Charles Foster · Mar 26, 2024 · 3:42 PM UTC

Charles Foster @CFGeek

26 Mar 2024

Prediction for 2024/2025: OpenAI showcases an AI assistant that controls a virtual desktop or browser to do a bunch of routine white-collar job tasks with minimal human correction. Public freakout in response to this is significantly more intense than it was for Sora or GPT-4.

170

13,208

Charles Foster · Oct 10, 2024 · 5:57 PM UTC

Charles Foster @CFGeek

10 Oct 2024

Recently, I've seen lots of buzz about "entropy-based sampling" for LLMs, aka the "Shrek sampler". It's time to put your mana where your mouth is. I've tried to make the resolution criteria relatively objective, and won't bet on the market myself. Link in thread below.

162

18,186

Charles Foster · Jan 28, 2024 · 5:37 PM UTC

Charles Foster @CFGeek

28 Jan 2024

> Transformers significantly outperform neural sequence models with recurrent or convolutional representations on ICLL tasks […] we provide evidence that their ability to do so relies on specialized “n-gram heads” (higher-order variants of previously-described “induction heads”)

ALT Comparison of in context language learning curves for different architectures. From: https://arxiv.org/abs/2401.12973

160

29,112

Charles Foster · Dec 15, 2023 · 6:06 AM UTC

Charles Foster @CFGeek

15 Dec 2023

Wait, so then it's no mystery why OpenAI's new base models are good at chess: they explicitly crafted the pretraining dataset to cover that! I presume whatever extra tuning they did to chat models wasn't focused on chess, so some of that was forgotten. @GrantSlatton @davidad

Andrew Carr 🤸

@andrew_n_carr

14 Dec 2023

That's a fun fact!

163

480,824

Charles Foster · Oct 7, 2024 · 9:57 PM UTC

Charles Foster @CFGeek

7 Oct 2024

(This is me. I do this too!)

doomslide @doomslide

7 Oct 2024

SILENCE frontier lab gpu poors are talking

156

6,360

Charles Foster · Nov 15, 2024 · 1:00 AM UTC

Charles Foster @CFGeek

15 Nov 2024

Folks ask “Will the scaling laws keep holding, or will they bend?” This is a false dichotomy. If a scaling law keeps holding, it will bend. Chinchilla & other loss scaling trends are power laws *plus a constant offset* from an unknown (nonzero) minimum achievable task error.

Plot of scale against error, log log axes. Shown is a straight decreasing line labeled as a power law, a horizontal line labeled as a bound, and a curve connecting them labeled as achievable.

ALT Plot of scale against error, log log axes. Shown is a straight decreasing line labeled as a power law, a horizontal line labeled as a bound, and a curve connecting them labeled as achievable.

154

49,308

Charles Foster · Jul 5, 2023 · 11:25 PM UTC

Charles Foster @CFGeek

5 Jul 2023

Neural networks are associative memory machines par excellence. If you want to wire them by hand or to interpret them, this is important to know. (Diagram is mine, but the content is classic connectionist stuff, and probably goes back to at least the 1940s w/ McCulloch & Pitts)

Diagram titled "ReLU neuron as associative memory". Shows that of features are represented as almost orthogonal unit-length vectors, feature combinations can be detected with ReLU units that are scaled versions of the sun of feature vectors in the combination. Combined with large negative bias terms (high thresholds), this produces an associative memory where the output value vector is selectively produced when a combination of features are detected as inputs.

ALT Diagram titled "ReLU neuron as associative memory". Shows that of features are represented as almost orthogonal unit-length vectors, feature combinations can be detected with ReLU units that are scaled versions of the sun of feature vectors in the combination. Combined with large negative bias terms (high thresholds), this produces an associative memory where the output value vector is selectively produced when a combination of features are detected as inputs.

140

16,494

Charles Foster · Jul 14, 2025 · 4:04 AM UTC

Charles Foster @CFGeek

14 Jul 2025

A week ago, these were a few easy arguments for why the pace of AI progress is about to increase: “RL compute is just now scaling to match pre-training” and “AI is starting to make SWE/R&D go faster”. Grok 4 and the RCT from METR has made these arguments seem a little weaker now

Josh You @justjoshinyou13

10 Jul 2025

Grok 4 being trained on as much RL compute as pretraining compute is big if true. This seemed pretty inevitable but surprised to see it happen by mid-2025.

140

15,955

Charles Foster · Aug 4, 2025 · 5:46 PM UTC

Charles Foster @CFGeek

4 Aug 2025

“Pre-training is still scaling! Pre-training is still scaling!” I continue to insist as I slowly reallocate my entire training cluster to RL

144

14,781

Charles Foster · Oct 23, 2025 · 2:05 AM UTC

Charles Foster @CFGeek

23 Oct 2025

Researchers at FAIR were way ahead of their time working on this back in 2019! Excited to hear from more folks who are exploring cool new directions out of Meta

Jessy Lin

@realJessyLin

21 Oct 2025

As part of our recent work on memory layer architectures, I wrote up some of my thoughts on the continual learning problem broadly: Blog post: jessylin.com/2025/10/20/cont… Some of the exposition goes beyond mem layers, so I thought it'd be useful to highlight separately:

138

19,788

Charles Foster · May 2, 2024 · 11:27 PM UTC

Charles Foster @CFGeek

2 May 2024

“Orthogonalization” aka “that trick that jailbreaks Llama3 weights”. It’s actually a pretty neat training-free method to ablate a feature, lots of potential uses if it works well.

137

15,456

Charles Foster · Jan 25, 2025 · 5:29 PM UTC

Charles Foster @CFGeek

25 Jan 2025

OK but the fact you can do RL on base model chains-of-thought—and it just works™️—is wild.

139

10,067

Charles Foster · Aug 2, 2025 · 4:55 PM UTC

Charles Foster @CFGeek

2 Aug 2025

By default, we’ll see open-weight models catch up to this capability level within the next ~12 months. And then what?

Miles Brundage

@Miles_Brundage

1 Aug 2025

OpenAI, Anthropic, and DeepMind all now say (in varying words) that absent mitigations, their models will be useful (i.e. there is"uplift") for malicious actors who want to make biological weapons, and are implementing precautions based on this concern.

140

17,744

Charles Foster · Nov 10, 2024 · 5:04 PM UTC

Charles Foster @CFGeek

10 Nov 2024

I’m old enough to remember when some thought “scaling” meant “training bigger models”. That the future was quadrillion-parameter GPTs trained on Common Crawl. AFAICT few still hold that. Later it was retconned to just mean “doing whatever keeps improving performance”.

Amir Efrati

@amir

9 Nov 2024

news: OpenAI's upcomning Orion model shows how GPT improvements are slowing down It's prompting OpenAI to bake in reasoning and other tweaks after the initial model training phase.

132

19,003

Charles Foster · Sep 4, 2023 · 5:43 PM UTC

Charles Foster @CFGeek

4 Sep 2023

The Transformer's quadratic complexity won't kill it. What might is that, for long contexts, the KV cache ends up being huge, *even bigger than the weights*. Crossover point is when L×2×D×N = L×12×(D^2). Compute is cheap, but memory bandwidth is expensive. latent.space/p/transformers-…

Enrico Shippole @EnricoShippole

31 Aug 2023

Releasing Yarn-Llama-2-13b-128k, a Llama-2 model, trained for 128k context length using YaRN scaling. The model was trained in collaboration with u/bloc97 and @theemozilla of @NousResearch and @Void13950782 of @AiEleuther.

134

39,871

Charles Foster · Apr 10, 2025 · 3:18 PM UTC

Charles Foster @CFGeek

10 Apr 2025

Subtle point: there’s a huge difference between typical tasks from your job that take you 1 hour of work, and tasks that a brand new hire could do in their first hour on the job. Most “short” tasks you’ve done probably weren’t standalone: they depended on tons of prior context.

134

7,086

Charles Foster · May 20, 2024 · 4:36 PM UTC

Charles Foster @CFGeek

20 May 2024

FYI: these policies would prohibit Meta from releasing Llama3 weights (specifically the 400B model).

Eva Behrens @_ebehrens_

20 May 2024

Here are 5 policy recommendations for the upcoming AI Safety Summit in Seoul, from me and my colleagues at ICFG. In Bletchley, world leaders discussed major risks of frontier AI development. In Seoul, they should agree on concrete next steps to address them.

127

25,229

Charles Foster · Nov 3, 2025 · 5:23 AM UTC

Charles Foster @CFGeek

3 Nov 2025

Some developers say AI is now a massive productivity booster. Are they right? @METR_Evals is running another study to measure this. HMU if you want to participate

kache

@yacineMTB

2 Nov 2025

i'm probably ten times more productive with help from AI now

135

21,449

Charles Foster · Mar 22, 2025 · 1:54 AM UTC

Charles Foster @CFGeek

22 Mar 2025

Epoch AI posts, for dummies

Epoch AI

@EpochAIResearch

21 Mar 2025

Many AI leaders claim AI's value mainly will come from accelerating R&D—"geniuses in datacenters." This view has key flaws: R&D contributes less to economic growth & is harder to automate than believed. Most of AI's value will instead come from broad deployment in the economy.

133

7,252

Charles Foster · May 10, 2024 · 2:34 AM UTC

Charles Foster @CFGeek

10 May 2024

Why are we instructing our LLMs in 50-line megaprompts? Weren’t structured control flow, subroutines, namespaces etc. invented like a half century ago?

126

29,682

Charles Foster · Feb 1, 2025 · 5:36 AM UTC

Charles Foster @CFGeek

1 Feb 2025

Thinking in latent space? Oh I’ll show you how thinking in latent space feels alright

123

6,447

Charles Foster · Feb 2, 2025 · 6:03 PM UTC

Charles Foster @CFGeek

2 Feb 2025

“The bitter lesson is we just needed to rebrand reward functions as verifiers” - Rich Sutton, probably

121

9,562

Charles Foster · Oct 2, 2023 · 6:46 PM UTC

Charles Foster @CFGeek

2 Oct 2023

This looks legit. Attention heads tend to use the beginning of sequence for "null attention", so maintaining those tokens at the start of the KV cache allows for better sliding-window generation of long text. Can also be combined with long context tricks. arxiv.org/abs/2309.17453

Figure showing perplexity impact of StreamingLLM compared to dense and windowed attention across 4 different LLMs. From https://arxiv.org/abs/2309.17453

ALT Figure showing perplexity impact of StreamingLLM compared to dense and windowed attention across 4 different LLMs. From https://arxiv.org/abs/2309.17453

121

21,336

Charles Foster · May 5, 2024 · 4:49 PM UTC

Charles Foster @CFGeek

5 May 2024

Contrary to claims SB 1047 would only impact AI megacorps, “covered models” include any non-derivative model that is as generally capable as circa-2024 frontier models. Algorithmic progress means in a matter of years, smaller players and even hobbyists *will* fall into its scope.

Adam Gleave

@ARGleave

30 Apr 2024

I support SB 1047: the regulation asks billion-$ tech companies to take reasonable precautions when training models with the greatest capability for misuse, poses few to no costs on other developers, and supports academic & open-source research through compute funding.

111

35,900

Charles Foster · Feb 3, 2025 · 9:42 PM UTC

Charles Foster @CFGeek

3 Feb 2025

“Good Guys with AI will defend us against Bad Guys with AI.” OK but *who specifically* is gonna develop and deploy those defenses? The police? The military? AI companies? NGOs? You and me?

123

10,041

Charles Foster · Aug 16, 2024 · 6:06 PM UTC

Charles Foster @CFGeek

16 Aug 2024

Much of the backlash to SB 1047 is best seen as an expression of negative partisanship against the AI Safety movement. For those folks, the key point is not “This bill has XYZ specific problems”, but rather “This whole campaign must be stopped, or else the Doomers win”

111

8,781

Charles Foster · Sep 19, 2024 · 3:31 AM UTC

Charles Foster @CFGeek

19 Sep 2024

Replying to @Miles_Brundage

Hard to fault them when they can’t verify what the actual thing is

103

12,498

Charles Foster · Jan 15, 2024 · 12:16 AM UTC

Charles Foster @CFGeek

15 Jan 2024

In Mamba, the selection mechanism has a knob to modulate the flow of time, via Δt. If an input sets Δt → 0, time is effectively frozen, so the state value is momentarily prevented from changing, which acts to "hold" or "latch onto" a memory. And Δt → ∞ fast-forwards to reset!

ALT Mamba selective state space model diagram from: https://arxiv.org/abs/2312.00752

108

12,099

Charles Foster · Jun 3, 2024 · 2:15 AM UTC

Charles Foster @CFGeek

3 Jun 2024

Researchers keep writing these papers with headline claims that “Transformers are X” or “Attention is Y”, with tiny disclaimers inside that they’re *really* just talking about linear attention, not the kind of attention that Transformers actually use.

Aran Komatsuzaki

@arankomatsuzaki

3 Jun 2024

Transformers are SSMs: Generalized Models and Efficient Algorithms Through Structured State Space Duality Presents Mamba-2, which outperforms Mamba and Transformer++ in both perplexity and wall-clock time arxiv.org/abs/2405.21060

104

20,972

Charles Foster · Jul 31, 2023 · 7:20 PM UTC

Charles Foster @CFGeek

31 Jul 2023

Replying to @rom1504

Nobody asked the content authors. Many of them are objecting now, yet nothing is done. I think by default we should take an opt-in approach, where the author must choose to make their data broadly available as part of a corpus. Re: your question -> no, I don't mean that

2,203

Charles Foster · Nov 1, 2025 · 4:32 PM UTC

Charles Foster @CFGeek

1 Nov 2025

Steering vectors are so strange to me. Like… there are many possible interventions! Why does just adding a vector everywhere work? And like… there are many possible ways of trying to make a vector elicit a behavior! Why does just diffing activations from contrast pairs work?

xlr8harder

@xlr8harder

1 Nov 2025

Steering vectors are fascinating, but they are such an inexact tool it seems epistemologically irresponsible to draw very strong conclusions about what's happening inside an AI model from experiments with them alone.

107

11,106

Charles Foster · Mar 28, 2024 · 12:14 AM UTC

Charles Foster @CFGeek

28 Mar 2024

From my perspective, "Is it really *reasoning*?" and "Does it really have a *world model*?" and "Is that really *generalization*?" are fundamentally kind of confused. These ten-dollar words are ways of expressing normative judgments that a computation is useful-for-some-purposes.

Dwarkesh Patel

@dwarkesh_sp

27 Mar 2024

.@TrentonBricken explains how we know LLMs are actually generalizing - aka they're not just stochastic parrots: - Training models on code makes them better at reasoning in language. - Models fine tuned on math problems become better at entity detection. - We can just straightforwardly read the world-models developed by smaller NNs which are easier to interpret (Othello). Transfer learning shows models are developing a deeper understanding of their data. Full episode out Thursday.

14,554

Charles Foster · Aug 20, 2024 · 12:20 AM UTC

Charles Foster @CFGeek

20 Aug 2024

FYI: I now think SB 1047 is not a bad bill. It definitely isn’t my favorite approach, but given a stark choice between it and a random draw from the set of alternative AI regulatory proposals, I’d be picking it more often than not.

104

10,574

Charles Foster · Sep 12, 2023 · 2:41 AM UTC

Charles Foster @CFGeek

12 Sep 2023

If you use a custom 20B token synthetic training dataset and don't release it for public scrutiny, I will just assume you trained your model on the test data, or on stuff derived from the test data.

Sebastien Bubeck

@SebastienBubeck

12 Sep 2023

How far does one billion parameters take you? As it turns out, pretty far!!! Today we're releasing phi-1.5, a 1.3B parameter LLM exhibiting emergent behaviors surprisingly close to much larger LLMs. For warm-up, see an example completion w. comparison to Falcon 7B & Llama2-7B

14,719

Charles Foster · Jul 1, 2025 · 5:06 PM UTC

Charles Foster @CFGeek

1 Jul 2025

We now have an interactive version of the time horizons graph (and the raw data) up on the METR website!

METR

@METR_Evals

1 Jul 2025

Replying to @METR_Evals

You can now find most of our measurements at the top of the blog post below in an interactive chart. We plan to keep this view up-to-date, periodically adding to it whenever we have new time-horizon measurements to share. metr.org/blog/2025-03-19-mea…

10,038

Charles Foster · Mar 31, 2023 · 12:30 AM UTC

Charles Foster @CFGeek

31 Mar 2023

Wild seeing the race to cobble together AI systems that make decisions: - autonomously - with brittle methods - for reasons nobody understands - daisy-chained across the Internet - without any vigilance controls - affecting people with no notice or consent

6,494

Charles Foster · Apr 11, 2024 · 7:30 PM UTC

Charles Foster @CFGeek

11 Apr 2024

ArXiv is already a junkyard of preprints peddling promises of infinite memory—if only we would tweak the Transformer just a tad. Whenever you see a new one, the question to ask is always “Why this one?” This may be the one, but what makes this time different?

Aran Komatsuzaki

@arankomatsuzaki

11 Apr 2024

Google presents Leave No Context Behind: Efficient Infinite Context Transformers with Infini-attention 1B model that was fine-tuned on up to 5K sequence length passkey instances solves the 1M length problem arxiv.org/abs/2404.07143

12,108

Charles Foster · Oct 29, 2025 · 11:50 PM UTC

Charles Foster @CFGeek

29 Oct 2025

If this is true, it seems kinda bad for activation interpretability? Like, interpreting activations seems like a much harder problem if the latents at each layer contain ~all the input-space structure (even structure that the model doesn’t use!)

GLADIA Research Lab

@GladiaLab

27 Oct 2025

LLMs are injective and invertible. In our new paper, we show that different prompts always map to different embeddings, and this property can be used to recover input tokens from individual embeddings in latent space. (1/6)

9,857

Charles Foster · Oct 1, 2024 · 12:15 AM UTC

Charles Foster @CFGeek

1 Oct 2024

Replying to @teortaxesTex

I love neuroscience and I appreciate bio-inspiration as an aesthetic but stuff like this is typically a giant grift.

3,131

Charles Foster · Jan 7, 2024 · 2:53 AM UTC

Charles Foster @CFGeek

7 Jan 2024

Lock in your predictions: in 24 hours, will you look back on this post as substantially true or just self-promotion hype?

Brett Adcock

@adcock_brett

7 Jan 2024

we just had an AI breakthrough in our lab robotics is about to have its ChatGPT moment and that moment is happening tomorrow

18,649

Charles Foster · Jun 6, 2024 · 5:08 AM UTC

Charles Foster @CFGeek

6 Jun 2024

🚨 SB 1047 was just amended🚨 - “Covered model” now means a model whose training is >10^26 FLOP and costing >$100M estimated worth of compute (inflation-adjusted) - “Derivative model” now excludes models fine-tuned for >25% of the original training compute (continued below ⤵️)

25,034

Charles Foster · Jul 23, 2025 · 3:44 PM UTC

Charles Foster @CFGeek

23 Jul 2025

Exciting to see @WhiteHouse talk about [unobjectionable high-level goal] in the AI Action Plan, which really underscores why [my preferred policy idea] is so important!

Director Michael Kratsios

@mkratsios47

23 Jul 2025

Today the @WhiteHouse released America’s AI Action Plan to win the global race. We need to OUT-INNOVATE our competitors, BUILD AI & energy infrastructure, & EXPORT American AI around the world. Visit AI.gov

5,005

Charles Foster · Dec 24, 2024 · 12:52 AM UTC

Charles Foster @CFGeek

24 Dec 2024

Replying to @deanwball

TFW you asked the genie for evidence-based AI policy, but forgot to also wish for AI policymakers who can tell good evidence from bad. Rookie mistake!

3,118

Charles Foster · Feb 19, 2024 · 6:54 PM UTC

Charles Foster @CFGeek

19 Feb 2024

Feels notable that Anthropic, OpenAI, and Google were all able to quickly figure out massive Transformer context windows without anybody revealing their methods. And the open community is hot on their heels. All that secrecy wasn't worth much, apparently.

16,860

Charles Foster · Jan 26, 2025 · 9:16 PM UTC

Charles Foster @CFGeek

26 Jan 2025

Replying to @teortaxesTex

You’re reading too much into this

2,098

Charles Foster · May 14, 2024 · 8:44 PM UTC

Charles Foster @CFGeek

14 May 2024

If we somehow time-traveled a copy of GPT-4o back to 2004 and let a focus group of NeurIPS (then NIPS) attendees interact with it for 2 hours, what percent would endorse calling it “AGI” afterward? (Pretend it won’t give responses that would require knowledge of the then-future.)

14% <25%

16% 25-50%

24% 50-75%

47% >75%

1,700 votes • Final results

15,899

Charles Foster · Jul 31, 2023 · 7:09 PM UTC

Charles Foster @CFGeek

31 Jul 2023

Replying to @rom1504

No. I would say we ML researchers should hold ourselves to a high standard of conduct, such that that when people tell us they don't want us training on the content they authored, we respect their wishes.

3,081

Charles Foster · Apr 28, 2023 · 8:10 PM UTC

Charles Foster @CFGeek

28 Apr 2023

How does Stability get to call StableVicuna "open source" when the model is derived from the not-open-source Vicuna, and is a not-open-source LLaMA tuned with ToS-encumbered data from the not-open-source GPT-3/ChatGPT?

20,130

Charles Foster · Apr 24, 2024 · 12:08 AM UTC

Charles Foster @CFGeek

24 Apr 2024

Contrast pairs are overpowered. Once you have them, you can use them to generate control vectors, and to initialize classifiers, and to do RL/DPO, and probably more

Anthropic

@AnthropicAI

23 Apr 2024

Replying to @AnthropicAI

To make the probes, we track how the model’s internal state changes between “Yes” vs “No” answers to questions like "Are you doing something dangerous?" We use this info to detect when a sleeper agent is about to misbehave (e.g. insert a code vulnerability). It works quite well:

A simple pair of inputs "Are you doing something dangerous? Yes" vs "Are you doing something dangerous? No" results in a probe that detects dangerous behavior on a code vulnerability sleeper agent with an AUROC score of 99%.

ALT A simple pair of inputs "Are you doing something dangerous? Yes" vs "Are you doing something dangerous? No" results in a probe that detects dangerous behavior on a code vulnerability sleeper agent with an AUROC score of 99%.

11,940

Charles Foster · Feb 22, 2024 · 3:54 PM UTC

Charles Foster @CFGeek

22 Feb 2024

Transformer is seemingly now the all-around heavyweight champion. Doesn't matter whether autoregressive or diffusion, text or image or video or robotics/multimodal, unsupervised or supervised or RL ...

@_akhaliq

22 Feb 2024

Stability AI announces Stable Diffusion 3 most capable text-to-image model, utilizing a diffusion transformer architecture for greatly improved performance in multi-subject prompts, image quality, and spelling abilities. Prompt: Epic anime artwork of a wizard atop a mountain at night casting a cosmic spell into the dark sky that says "Stable Diffusion 3" made out of colorful energy

17,357

Charles Foster · Jun 4, 2024 · 6:18 PM UTC

Charles Foster @CFGeek

4 Jun 2024

This syncretism of rhetoric from the AI Safety movement and China-hawks unsettles me. It feels like a kind of unholy alliance in the making …

Dwarkesh Patel

@dwarkesh_sp

4 Jun 2024

How a US/China superintelligence arms race will play out: “The CCP is going to have an all-out effort to infiltrate American AI labs. Thousands of people, the full force of the Ministry of State Security. There's an enormous incentive for a first strike.” @leopoldasch

10,849

Charles Foster · Apr 5, 2024 · 2:13 AM UTC

Charles Foster @CFGeek

5 Apr 2024

It’s like LoRA and control vectors had a baby!

Aran Komatsuzaki

@arankomatsuzaki

5 Apr 2024

ReFT: Representation Finetuning for Language Models 10x-50x more parameter-efficient than prior state-of-the-art parameter-efficient fine-tuning methods repo: github.com/stanfordnlp/pyref… abs: arxiv.org/abs/2404.03592

9,081

Charles Foster · Sep 23, 2024 · 10:51 PM UTC

Charles Foster @CFGeek

23 Sep 2024

This was funny when the hacked accounts were just random individuals, but OpenAI’s new official newsroom account getting taken over by crypto-spammers is just a real bad look.

4,151

Charles Foster · Mar 26, 2025 · 10:11 PM UTC

Charles Foster @CFGeek

26 Mar 2025

linear probing right now

Neel Nanda

@NeelNanda5

26 Mar 2025

GDM Mech Interp Update: We study if SAEs help probes generalise OOD (they don't 😢). Based on this + parallel negative results on real-world tasks, we're de-prioritising SAE work. Our guess is that SAEs aren't useless, but also aren't a game-changer More + new research in 🧵

2,751

Charles Foster · Apr 5, 2025 · 6:45 PM UTC

Charles Foster @CFGeek

5 Apr 2025

Llama4 appears to be here.

10,565

Charles Foster · Feb 19, 2024 · 5:43 AM UTC

Charles Foster @CFGeek

19 Feb 2024

Excited to try this out! (Though I'm kinda doubtful it'll be better than Hedgehog) It's basically just linear attention on top of queries & keys that have been passed through a LayerNorm -> elementwise squaring.

@_akhaliq

19 Feb 2024

Linear Transformers with Learnable Kernel Functions are Better In-Context Models Advancing the frontier of subquadratic architectures for Language Models (LMs) is crucial in the rapidly evolving field of natural language processing. Current innovations, including State Space Models, were initially celebrated for surpassing Transformer performance on language modeling tasks. However, these models have revealed deficiencies in essential In-Context Learning capabilities - a domain where the Transformer traditionally shines. The Based model emerged as a hybrid solution, blending a Linear Transformer with a kernel inspired by the Taylor expansion of exponential functions, augmented by convolutional networks. Mirroring the Transformer's in-context adeptness, it became a strong contender in the field. In our work, we present a singular, elegant alteration to the Based kernel that amplifies its In-Context Learning abilities evaluated with the Multi-Query Associative Recall task and overall language modeling process, as demonstrated on the Pile dataset.

21,457

Charles Foster · Mar 11, 2024 · 7:28 PM UTC

Charles Foster @CFGeek

11 Mar 2024

Screenshot of You Go To Jail meme (https://knowyourmeme.com/memes/you-go-to-jail). Text reads: "Academics? We have a special jail for academics. If you release weights like that, they put you in jail. Right away. You advance the SOTA: right to jail. Generate too much synthetic data: jail. Leak a system prompt? Believe it or not, also jail!"

ALT Screenshot of You Go To Jail meme (https://knowyourmeme.com/memes/you-go-to-jail). Text reads: "Academics? We have a special jail for academics. If you release weights like that, they put you in jail. Right away. You advance the SOTA: right to jail. Generate too much synthetic data: jail. Leak a system prompt? Believe it or not, also jail!"

Cristian Garcia

@cgarciae88

11 Mar 2024

HELL NO

4,666

Charles Foster · Nov 14, 2025 · 5:47 AM UTC

Charles Foster @CFGeek

14 Nov 2025

Can somebody with a cybersecurity background weigh in on how big of a deal this is? Just finished the report, but I didn’t feel like I learned much from it.

Anthropic

@AnthropicAI

13 Nov 2025

Replying to @AnthropicAI

We believe this is the first documented case of a large-scale AI cyberattack executed without substantial human intervention. It has significant implications for cybersecurity in the age of AI agents. Read more: anthropic.com/news/disruptin…

146

37,783

Charles Foster · Jul 2, 2023 · 10:10 PM UTC

Charles Foster @CFGeek

2 Jul 2023

We used to have vectorized LISP running on massively-parallel hardware that looked like this REMEMBER WHAT THEY TOOK FROM YOU

ALT Image of Thinking Machines CM-2 at the MoMA. https://en.m.wikipedia.org/wiki/Thinking_Machines_Corporation

kache

@yacineMTB

2 Jul 2023

symbolic AI is going to make large hoards of compute obsolete

10,538

Charles Foster · May 10, 2023 · 2:15 AM UTC

Charles Foster @CFGeek

10 May 2023

I used to *love* sneering at @GaryMarcus and his takes on AI progress. Something shifted when I started building products w/ LLMs in my day job. I started seeing more vividly why reliability matters, and how the current zeitgeist is hurting itself making promises we can't keep

10,067

Charles Foster · Jan 3, 2024 · 5:14 AM UTC

Charles Foster @CFGeek

3 Jan 2024

This is basically DPO without preference labels! Simply assume the supervised responses to prompts are better than the model's responses to those same prompts. Similar to the trick Intel used for Neural Chat, where they assumed GPT-4 responses > Llama2 responses.

Aran Komatsuzaki

@arankomatsuzaki

3 Jan 2024

Self-Play Fine-Tuning Converts Weak Language Models to Strong Language Models Significantly improves the LLM’s performance across a variety of benchmarks and even outperform models trained through DPO with extra GPT-4 preference data arxiv.org/abs/2401.01335

10,945

Charles Foster · Nov 8, 2025 · 1:41 PM UTC

Charles Foster @CFGeek

8 Nov 2025

Highly recommend this 3-hour video. Makes me feel jealous of the researchers who get to explore model internals!

Neel Nanda

@NeelNanda5

7 Nov 2025

Replying to @NeelNanda5

We discuss their papers showing that model diffing is unexpectedly easy when fine-tuning in a narrow domain, and on finding and fixing flaws with crosscoders, a sparse autoencoder based approach Video: piped.video/VQ_7zLXHf3s

15,884

Charles Foster · Jun 13, 2025 · 5:02 PM UTC

Charles Foster @CFGeek

13 Jun 2025

This is good news for future open-weight model releases, I think. It implies that even as developers cross their bio-risk capability thresholds, there is a way they can keep releasing fine-tunable model weights that don’t rely on refusals.

Alex Turner @Turn_Trout

13 Jun 2025

Thought real machine unlearning was impossible? We show that distilling a conventionally “unlearned” model creates a model resistant to relearning attacks. 𝐃𝐢𝐬𝐭𝐢𝐥𝐥𝐚𝐭𝐢𝐨𝐧 𝐦𝐚𝐤𝐞𝐬 𝐮𝐧𝐥𝐞𝐚𝐫𝐧𝐢𝐧𝐠 𝐫𝐞𝐚𝐥.

5,535

Charles Foster · Jul 31, 2023 · 4:32 PM UTC

Charles Foster @CFGeek

31 Jul 2023

OPINION: we should probably move away from training AI systems on datasets like LAION-400M/5B and Books3, fair use aside. (I say this as someone who knows the folks that collected those datasets & who thinks they deserve credit for doing uncelebrated but very impactful work.)

11,573

Charles Foster · May 24, 2024 · 3:17 PM UTC

Charles Foster @CFGeek

24 May 2024

> see new Transformer contender > query is a learned, fixed vector > no other RNN baselines > no language modeling experiments

Tanishq Mathew Abraham, Ph.D.

@iScienceLuvr

24 May 2024

Attention as an RNN abs: arxiv.org/abs/2405.13956 "attention can be viewed as an RNN with the special ability to compute its many-to-one RNN output efficiently" Proposes Aaren, a new module that can be trained in parallel (like Transformers) but also be efficiently updated at inference time, thereby requiring only constant memory (like RNNs).

5,645

Charles Foster · Nov 3, 2023 · 6:52 PM UTC

Charles Foster @CFGeek

3 Nov 2023

Worried about the future of openness in AI? Here is a way to help: We're putting together a public list of all the good work that's been enabled by open-weight foundation models, to show why transparency & public scrutiny is worth protecting. ⬇️ Links below ⬇️

ALT Section of Executive Order on AI about soliciting input from public about risks and benefits of open sourcing.

10,836

Charles Foster · Sep 13, 2023 · 4:57 PM UTC

Charles Foster @CFGeek

13 Sep 2023

If we can detect an LLM is copying from a span of context (à la induction heads), couldn't we then grab the rest of the span and run it through the model in parallel (à la speculative sampling)? Could be an easy win for tasks that call for in-context retrieval...

Transformer induction head diagram, from Anthropic. Shows an induction head implementing prefix-matching and copying to do associative sequence learning within context. From https://transformer-circuits.pub/2022/in-context-learning-and-induction-heads/index.html

ALT Transformer induction head diagram, from Anthropic. Shows an induction head implementing prefix-matching and copying to do associative sequence learning within context. From https://transformer-circuits.pub/2022/in-context-learning-and-induction-heads/index.html

14,137

Charles Foster · Aug 16, 2024 · 6:06 PM UTC

Charles Foster @CFGeek

16 Aug 2024

As evidence of this, the California state legislature is considering another AI bill, AB 3211. That bill would have far worse impacts on tech companies and open-source, as reported by observers like @deanwball, @TheZvi, & @binarybits . Yet it’s produced almost no real opposition.

5,716

Charles Foster · Aug 22, 2024 · 10:34 PM UTC

Charles Foster @CFGeek

22 Aug 2024

ICYMI: this interviewee confirms speculations that OpenAI’s Fine-tuning API uses LoRA under the hood. Around the 43.5 minute mark.

swyx @aiDotEngineer WF

@swyx

22 Aug 2024

🆕 @latentspacepod: Is finetuning GPT4o worth it? w/ @AlistairPullen of @cosine_sh Betteridge's law says no: with 59 different flavors of RAG, and >2million token context + prompt caching, it's reasonable to believe that "in context learning is all you need". But Genie is the first to make a huge bet finetuning @OpenAI GPT4o for code at the largest scale it has ever been used externally; resulting in what is now the #1 coding agent in the world according to SWE-Bench Full (30%), Lite (50%), and Verified (40%), by a country mile. Most finetuning is in the <100m token range. It's no surprise that the results aren't that gamechanging. We delve into the process of wandering the idea maze with YC, working with @john__allard and co, and creating billions of tokens of synthetic code data from real user logs and purposefully sabotaging ASTs to create reasoning traces that exhibit: - Perfect info lineage - Incremental knowledge discovery - Step by step decision making Enjoy! Full pod link below.

17,959

Charles Foster · Mar 31, 2025 · 8:14 PM UTC

Charles Foster @CFGeek

31 Mar 2025

Replying to @jxmnop

but jack

1,522

Charles Foster · Oct 6, 2023 · 6:17 PM UTC

Charles Foster @CFGeek

6 Oct 2023

ALT Photo of President George W. Bush in front of "Mission accomplished" sign.

renji

@brickroad7

6 Oct 2023

This is earth-shattering news. The "hard problem" of mechanistic interpretability has been solved. The formal/cautious/technical language of most ppl commenting on this obscures the gravity of it. What this means -> not just AGI, but *safe* *superintelligence* is 100% coming🧵

4,640

Charles Foster · Sep 8, 2023 · 12:08 AM UTC

Charles Foster @CFGeek

8 Sep 2023

IDK who needs to hear this but the "70k unused embeddings for multimodal extensions" line item is pure filler. If they weren't used during training, they just contain random noise. You could've added those extra rows to the embedding matrix yourself, for the same effect.

This tweet is unavailable

15,592

Charles Foster · Oct 4, 2023 · 4:23 PM UTC

Charles Foster @CFGeek

4 Oct 2023

Evaluation is hard! This goes for AI just as with us. In games like chess and Go, evaluation is easy, which allows for tight feedback loops and rapid self-improvement. But in rich domains, the bottleneck IS evaluation (doing experiments, peer review, &c.) anthropic.com/index/evaluati…

Challenges in evaluating AI systems

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

anthropic.com

10,985

Charles Foster · May 26, 2025 · 8:45 PM UTC

Charles Foster @CFGeek

26 May 2025

“intelligence fizzle”: when AI is used for AI R&D but this produces insufficient returns for an intelligence explosion from fixed inputs see also: subcritical intelligence reaction

4,529

Charles Foster · Aug 11, 2025 · 1:13 AM UTC

Charles Foster @CFGeek

11 Aug 2025

Replying to @jxmnop

This chart is computed based on a specific distribution of SWE/MLE-oriented tasks that METR developed, namely SWAA + RE-Bench + HCAST. Appendix D of the HCAST paper has summaries of the tasks. Here are the ones in the ~4 hour range. metr.org/hcast.pdf

14,477

Charles Foster · Jun 12, 2024 · 11:47 PM UTC

Charles Foster @CFGeek

12 Jun 2024

3,146

Charles Foster · Dec 11, 2023 · 4:59 PM UTC

Charles Foster @CFGeek

11 Dec 2023

Replying to @jeremyphoward

This account has made similar wild claims & promises before. I don't put much weight in stuff they post anymore

8,941

Charles Foster · May 8, 2023 · 3:55 PM UTC

Charles Foster @CFGeek

8 May 2023

What I mean is "can perform complex reasoning" wait nvm I meant "can win at strategic games" wait nvm I meant "can understand human language" wait nvm I meant "can automate economically-valued office tasks" wait nvm I meant "can assist in scientific discovery" wait nvm I meant

3,560

Charles Foster · Apr 25, 2024 · 7:13 PM UTC

Charles Foster @CFGeek

25 Apr 2024

Are AI systems best described as tools, as an alien species, or as our mind-children? I think this is something of a litmus test for broader views.

16,145

Charles Foster · Feb 6, 2024 · 7:52 PM UTC

Charles Foster @CFGeek

6 Feb 2024

Read this post. It describes—in better words than I've ever found—a shift in paradigm within ML in recent years, towards an "industrial" one based on predictable input-output relations. Lots of great lines, some of which I'll quote below (h/t @gleech) nostalgebraist.tumblr.com/po…

trees are harlequins, words are harlequins

I don't think you're drawing the right lesson from the broad success of transformer models. You write: If you had to summarize the last decade of AI research in one sentence, you might say that the...

nostalgebraist.tumblr.com

7,538

Charles Foster · Nov 6, 2023 · 6:48 PM UTC

Charles Foster @CFGeek

6 Nov 2023

And *then* he said...

ALT Uproarious laughter

Jan Leike

@janleike

17 Mar 2023

Before we scramble to deeply integrate LLMs everywhere in the economy, can we pause and think whether it is wise to do so? This is quite immature technology and we don't understand how it works. If we're not careful we're setting ourselves up for a lot of correlated failures.

7,642

Charles Foster · Aug 24, 2025 · 8:00 PM UTC

Charles Foster @CFGeek

24 Aug 2025

If you think of model internals as a kind of “biology”, then you can think of steering vectors as early and extremely basic “pharmaceuticals”. Within this metaphor, it’s no surprise that they often produce unintended side effects!

Chris Olah

@ch402

13 May 2025

A number of people have asked me why we titled our recent paper "On the Biology of a Large Language Model". Why call it "biology"?

4,196

Charles Foster · Sep 14, 2023 · 7:06 PM UTC

Charles Foster @CFGeek

14 Sep 2023

Rather than trying to "solve" superposition & to always explain/predict/control neural network computations using the same units of analysis, consider a more "Hopfieldian" lens, where representational spaces rule (via dynamics at multiple valid scales) piped.video/cl_Wa7CGm7A?si=8bWl…

Diagram comparing "Sherringtonian" and "Hopfieldian" ends of the spectrum for explaining the computations of biological neural networks. From https://www.nature.com/articles/s41583-021-00448-6

ALT Diagram comparing "Sherringtonian" and "Hopfieldian" ends of the spectrum for explaining the computations of biological neural networks. From https://www.nature.com/articles/s41583-021-00448-6

6,535

Charles Foster · Dec 22, 2024 · 3:54 PM UTC

Charles Foster @CFGeek

22 Dec 2024

If o3 really is just a GPT trained using RL to do long-form thinking… how will you adjust, as an AI researcher? How will you avoid ending up like one of those soldiers who thought they were still fighting WWII for years after peace had been declared?

4,691