What you need to know about AI research trends, from @natolambert Wednesday mornings weekly, sometimes extra posts.

A recipe for frontier model post-training Apple, Meta, and Nvidia all agree — synthetic data, iterative training, human preference labels, and lots of filtering. interconnects.ai/p/frontier-…
1
25
170
83,512
OpenAI's o1 using "search" was a PSYOP How to understand OpenAI's o1 models as really just one wacky, wonderful, long chain of thought. interconnects.ai/p/openais-o…
5
22
165
48,961
Synthetic data: Anthropic’s CAI, from fine-tuning to pretraining, OpenAI’s Superalignment, tips, types, and open examples Synthetic data is the accelerator of the next phase of AI — what it is and what it means. interconnects.ai/p/llm-synth…
2
23
121
93,054
Reverse engineering OpenAI’s o1 What productionizing test-time compute shows us about the future of AI. Exploration has landed in language model training. interconnects.ai/p/reverse-e…
19
116
39,371
China's Top 19 Open Model Labs We ranked all the organizations in China releasing open models, from the top of DeepSeek to small, newer academic labs making waves with tech reports and niche models. interconnects.ai/p/chinas-to…
6
28
118
201,650
An unexpected RL Renaissance New talk! Forecasting the Alpaca moment for reasoning models and why the new style of RL training is a far bigger deal than the emergence of RLHF. YouTube: piped.video/watch?v=YXTYbr3h… Slides: docs.google.com/presentation… More info: interconnects.ai/p/an-unexpe…
2
14
86
45,685
OpenAI’s Strawberry, LM self-talk, inference scaling laws, and spending more on inference Whether or not scaling works, we should spend more on inference. interconnects.ai/p/openai-st…
10
84
72,700
This also means you can write off your Interconnects AI subscription. Not official tax advice.
wow - $5k tax free for ai retooling
21
35
4,697
We're all excited about the GPT-5 release. Here's a fun game for while you watch. Potential prizes coming later! Livestream links coming soon.
9
2
74
31,961
Futures of the data foundry business model Scale AI’s future versus further scaling of language model performance. How Nvidia may take all the margins from the data market, too. interconnects.ai/p/ai-data-f…
1
5
66
107,922
Model merging lessons in The Waifu Research Department When what seems like pure LLM black magic is actually supported by the literature. interconnects.ai/p/model-mer…
2
10
64
32,381
In the last ~6 months more closely analyzing the open models and datasets of note across the community on @huggingface, we've highlighted artifacts from 141 different organizations. It takes many people to build the open ecosystem for AI. ACE-Step AI-MO AIDC-AI ASLP-lab Alpha-VLLM AtlaAI BAAI (4) BLIP3o ByteDance (3) ByteDance-Seed (4) CYFRAGOVPL CohereLabs (5) DataoceanAI DatologyAI (2) Datou1111 Etched EuroBERT Freepik (2) GSAI-ML Goedel-LM Hcompany HelloKKMe HiDream-ai HuggingFaceTB (3) ICTNLP JetBrains LGAI-EXAONE LLM360 (2) MiniMaxAI (2) NX-AI NexaAIDev Nexusflow NousResearch NovaSky-AI (2) Open-Reasoner-Zero OpenGVLab OpenPipe OuteAI POLARIS-Project PRIME-RL PeterJinGo PlayHT PleIAs (2) PrimeIntellect (2) Qwen (15) RekaAI Salesforce (2) Skywork (6) Snowflake SparkAudio StarJiaxing SultanR THUDM (3) TIGER-Lab UCSC-VLAA UW-Madison-Lee-Lab Wan-AI (3) WisdomShell XiaomiMiMo (2) Xkev Zyphra (2) agentica-org (2) ai21labs all-hands allenai (5) allura-org amd (3) answerdotai apple arcee-ai (6) arcinstitute bespokelabs canopylabs cl-nagoya convergence-ai deepcogito deepseek-ai (9) ds4sd echo840 facebook (3) fdtn-ai featherless-ai fishaudio genmo google (9) haizelabs hexgrad hkust-nlp hpcai-tech ibm-granite (9) inclusionAI (4) infgrad internlm (5) kuleshov-group kyutai (4) laion (2) lerobot lightonai m-a-p marin-community maya-multimodal meta-llama (2) metagene-ai microsoft (7) mistralai (6) mixedbread-ai mobiuslabsgmbh moonshotai (4) nanonets nllg nomic-ai (3) nvidia (18) open-r1 open-thoughts openbmb (3) opencompass osmosis-ai ostris perplexity-ai qihoo360 rednote-hilab reducto rhymes-ai (3) ruliad sand-ai sarvamai sesame si-community simplescaling stabilityai (2) stepfun-ai (4) tencent (3) thomas-sounack tiiuae (2) tngtech tomg-group-umd vidore (2) vikhyatk xlr8harder yentinglin zed-industries
1
10
22
5,884
DBRX: The new best open model and Databricks’ ML strategy Databricks’ new model is surpassing the performance of Mixtral and Llama 2 70B while still being in a size category that's reasonably accessible. interconnects.ai/p/databrick…
2
11
59
21,599
Ilya on deep learning in 2015 On vision and how to understand deep learning. interconnects.ai/p/ilya-on-d…
6
31
5,515
RLHF roundup: Getting good at PPO, sketching RLHF’s impact, RewardBench retrospective, and a reward model competition Things to be aware of if you work on language model fine-tuning. interconnects.ai/p/rlhf-roun…
12
52
12,660
GPT 5 Launch Party w/ Will Brown & Swyx nitter.app/i/broadcasts/1OyKALjqR…
6
10
42
10,107
As Meta tries to race to build a great new research lab, we wanted to remind everyone that the organization structure is just as much of a challenge as the personnel. Here are our takeaways from earlier in the year. Rec's first.
1
11
28
3,580
Interviewing Eugene Vinitsky (@EugeneVinitsky) on self-play for self-driving and what else people do with RL #13. Reinforcement learning fundamentals and scaling. interconnects.ai/p/interview…
1
7
49
15,864
RL backlog: OpenAI's many RLs, clarifying distillation, and latent reasoning Notes I forgot to publish. Closing some loose ends in the reasoning model discussions. interconnects.ai/p/rl-backlo…
1
9
51
25,557
We're working on best-available analyses of where open models come from, who uses them, and how much. What questions do you have?
There are like 10-20 Chinese orgs shipping open models that I try and keep a somewhat close eye on and there are like 3-4 in the rest of the world 😳
4
8
34
7,468
Kimi K2 and when "DeepSeek Moments" become normal One "DeepSeek Moment" wasn't enough for us to wake up, hopefully we don't need a third. interconnects.ai/p/kimi-k2-a…
3
7
41
25,316
OpenAI's o3: The grand finale of AI in 2024 A step change as influential as the release of GPT-4. Reasoning language models are the current and next big thing. interconnects.ai/p/openais-o…
1
1
38
22,245
OLMoE and the hidden simplicity in training better foundation models Ai2 released OLMoE, which is probably our “best” model yet relative to its peers, but not much has changed in the process. interconnects.ai/p/olmoe-and…
2
7
1,597
State-space LLMs: Do we need Attention? Mamba, StripedHyena, Based, research overload, and the exciting future of many LLM architectures all at once. interconnects.ai/p/llms-beyo…
5
23
2,056
Latest open artifacts (#12): Chinese models continue to dominate throughout the summer 🦦 A new flagship Qwen model, Qwen3-235B-A22B-Instruct-2507, and a general rise in ecosystem quality in Artifacts Log 12. interconnects.ai/p/latest-op…
1
1
35
11,678
DeepSeek V3 and the actual cost of training frontier AI models The $5M figure for the last training run should not be your basis for how much frontier AI models cost. interconnects.ai/p/deepseek-…
8
35
16,910
RLHF progress: Scaling DPO to 70B, DPO vs PPO update, Tülu 2, Zephyr-β, meaningful evaluation, data contamination Huge steps forward in confirming that RLHF can really help you on vibes based evaluation, among many other RLHF analyses. interconnects.ai/p/rlhf-prog…
1
2
33
18,479
The White House's plan for open models & AI research in the U.S. Thoughts on the new AI Action plan, American DeepSeek, and what comes next. interconnects.ai/p/the-white…
1
2
33
13,830
Artifacts Log 3: Synthetic math and Magpie datasets, another 1T param model, and many Mistral models Artifacts ~124 and on for the year. (partial $) interconnects.ai/p/artifacts…
3
5
1,926
Big Tech's LLM evals are just marketing A PSA everyone needs. The importance of a wait and see attitude when it comes to new models, big and small, open and closed. interconnects.ai/p/evals-are…
1
3
5
1,487
RLHF lit. review #1 and missing pieces in RLHF: Looking at the difference between two sets -- what rumors say industry leaders are doing with RLHF and what the literature is up to. A new series studying RLHF literature. interconnects.ai/p/rlhf-lit-…
7
29
10,862
Interviewing OLMo 2 leads: Open secrets of training language models What we have learned and are going to do next. YouTube: piped.video/dS7QI99uJVc Notes + Podcast: interconnects.ai/p/olmo-2-po…
8
26
61,660
What I'm reading (#2): More on Kimi K2, how to build a bad research center, Pretraining with RL, and sporks of AGI A quiet summer is all you need. interconnects.ai/p/what-im-r…
6
27
11,742
GPT-4.5: "Not a frontier model"? OpenAI's latest model raises more questions than answers, but no, the AI bubble isn't popping quite yet. interconnects.ai/p/gpt-45-no…
1
25
20,409
Sycophancy and the art of the model GPT-4o-simp, LMArena backlash, and people refusing to understand how messy and crucial RLHF is. interconnects.ai/p/sycophanc…
1
2
23
12,286
Interviewing Tri Dao and Michael Poli of Together AI on the future of LLM architectures The first Interconnects research interview! We go even further on the promise of state-space models in the emerging LLM market. interconnects.ai/p/interview…
3
23
7,888
Managing frontier model training organizations (or teams) How do the frontier labs consistently train great models? How can they fail? interconnects.ai/p/how-to-ma…
3
22
28,313
If you’re a student and want to read paid posts, contact @natolambert by email or DM. Happy to provide a base 80%+ discount.
3
3
26
23,446
The latest open artifacts (#8): The return of ~30B models, side effects of OpenAI's proposed DeepSeek ban, and yet another reasoning roundup Artifacts Log 8. Expect this pace to continue until mid summer. interconnects.ai/p/the-lates…
1
2
21
4,323
Latest open artifacts (Artifacts Log #10): New DeepSeek R1 0528!, more permissive licenses, everything as a reasoner, and from artifacts to agents interconnects.ai/p/latest-op…
2
19
15,103
Deep Research, information vs. insight, and the nature of science What AI will accelerate in the scientific process, what it cannot do, and how we can prepare for new manners of scientific investigation. interconnects.ai/p/deep-rese…
1
2
18
9,428
Undoing RLHF and the brittleness of safe LLMs Recent papers show most of the arguments about needing "safety" in releases of open LLM weights are nearly dead in the water. Yes, still release the parameters. Read here: interconnects.ai/p/undoing-r…
7
19
11,973
Interviewing Tim Dettmers (@Tim_Dettmers) on open-source AI: Agents, scaling, quantization and what's next Interconnects interview #10. Catching up with one of the leaders of open-source AI. interconnects.ai/p/tim-dettm…
1
1
17
16,542
Latest open artifacts (#13): The abundance era of open models Mostly thanks to Qwen, but now we're spoiled for choice and winds are shifting. interconnects.ai/p/latest-op…
1
3
18
13,258
ChatBotArena: The peoples’ LLM evaluation, the future of evaluation, the incentives of evaluation, and gpt2chatbot What the details tell us about the most in-vogue LLM evaluation tool — and the rest of the field. interconnects.ai/p/chatbotar…
2
17
3,837
OpenAI’s Model (behavior) Spec, RLHF transparency, personalization questions Now we will have some grounding for when weird ChatGPT behaviors are intended or side-effects — shrinking the Overton window of RLHF bugs. interconnects.ai/p/openai-rl…
6
2
9
5,412
Multimodal LM roundup: Unified IO 2, inputs and outputs, Gemini, LLaVA-RLHF, and RLHF questions A sampling of recent happenings in the multimodal space. Be sure to expect more this year. interconnects.ai/p/multimoda…
3
16
7,122
Elicitation, the simplest way to understand post-training An F1 analogy to help understand fast improvements in post-training on top of slow improvements in scaling. buff.ly/XmEa3XN
1
3
17
13,666
People use AI more than you think And businesses too. The most important trend in AI that gets washed away from between the headlines. interconnects.ai/p/people-us…
4
4
16
7,805
What people get wrong about the leading Chinese open models: Adoption and censorship Narrative violations on licenses, adoption, and censorship. interconnects.ai/p/what-peop…
1
3
14
21,445
Phi 3 and Arctic: Outlier LMs are hints Models that seem totally out of scope from recent open LLMs give us a sneak peek of where the industry will be in 6 to 18 months. interconnects.ai/p/phi-3-and…
3
14
9,411
Model commoditization and product moats Where moats are tested now that so many people have trained GPT4 class models. Claude 3, Gemini 1.5, Inflection 2.5, and Mistral Large are here to party. interconnects.ai/p/gpt4-comm…
1
15
14,213
The latest open artifacts (#9): RLHF book draft, where the open reasoning race is going, and unsung heroes of open LM work Artifacts Log 9. interconnects.ai/p/the-lates…
2
15
29,535
Evaluations: Trust, performance, and price (bonus, announcing RewardBench) Evaluation is not only getting harder with modern LLMs getting more complicated, it’s getting harder because it means something different. interconnects.ai/p/evaluatio…
3
15
5,805
The DPO debate: Do we need RL for RLHF? Direct vs. RL methods for preferences, more RLHF models, and hard truths in open RLHF work. We have more questions than answers. interconnects.ai/p/the-dpo-d…
2
3
12
7,151
We aren’t running out of training data, we are running out of open training data Data licensing deals, scaling, human inputs, and repeating trends in open vs. closed LLMs. interconnects.ai/p/the-data-…
2
14
6,899
AI for the rest of us Apple Intelligence makes a lot of sense when you get out of the AI bubble. Plus, the cool technical details Apple shared about their language models "thinking different." interconnects.ai/p/apple-int…
2
14
13,976
OpenAI's GPT-4.1 and separating the API from ChatGPT OpenAI's latest models optimizing on intelligence per dollar. We'll continue to see ChatGPT handled differently than the API business. interconnects.ai/p/openais-g…
3
10
3,679
The latest open artifacts (#6): Reasoning models, China's lead in open-source, and a growing multimodal space Artifacts Log 6. The open LM ecosystem yet again accelerates. interconnects.ai/p/open-arti…
2
13
7,585
How scaling changes model behavior Some trends are reasonable to extrapolate, some are not. Even for the trends we are succeeding at extrapolating, it is not clear how that signal translates into different AI behaviors. interconnects.ai/p/how-scali…
12
15,901
Making the U.S. the home for open-source AI Open-source AI is here to stay, but it is not a given that it will be American. interconnects.ai/p/making-th…
1
2
12
11,251
Mixtral Round-up: MoE trade-offs, release lessons, Mistral raises $400mil, Google's loss, vibes vs marketing Emergency blog 🚨 We have an amazing open mixture of experts model for the holidays! interconnects.ai/p/mixtral
1
3
11
10,511
We don’t need to reinvent everything to solve alignment Integrating some non-computing science into reinforcement learning from human feedback (RLHF) can give us the models we want. Bonus: OLMo 1.7-7B. interconnects.ai/p/reinventi…
1
2
11
5,363
It's 2024 and they just want to learn The state of the ML communities big and small starting 2024. My general expectations for the year. interconnects.ai/p/they-want…
3
12
9,668
Latest open artifacts (#11): Visualizing China's open models market share, Arcee's models, and VLAs for robotics Artifacts Log 11. interconnects.ai/p/latest-op…
3
12
11,423
Interviewing Finbarr Timbers on the "We are So Back" Era of Reinforcement Learning Interconnects interview #11. An overview on the past, present, and future of RL. interconnects.ai/p/finbarr-t…
1
2
11
5,733
This month's leading open model contributors in Artifacts Log. Thanks for continuing to release your work. Qwen (@Alibaba_Qwen) x5 Zhipu AI (@ZhipuAI) x2 NVIDIA (@nvidia) x2 OpenAI (@OpenAI) InclusionAI (@InclusionAI666) x2 Infinigence (@infinigenceAI) Tesslate (@TesslateAI) Arcee (@arcee_ai) DeepCogito (@DeepCogito) Kakao (@kakaocorpglobal) Skywork (@Skywork_ai) Tencent (@TencentGlobal) x2 InternLM (@intern_lm) StepFun (@StepFun_ai) SK Telecom (@SKtelecom) Xiaomi MiMo (@Xiaomi) OpenBMB (@OpenBMB) Quotient AI (@QuotientAI) Roblox (@Roblox) Knowledgator (@knowledgator) Cisco Foundation AI (@fdtn_ai) ByteDance Seed (@bytedance_talk) JHU CLSP (@jhuclsp) VAGO Solutions (@VAGOsolutions) NuMind (@numind_ai) Neta (@NetaArt_AI) Black Forest Labs (@bfl_ml) Numina (@ProjectNumina) Krea AI (@krea_ai) Hugging Face M4 (@huggingface) moondream (@moondreamai) SpatialVerse (@spatialverse) RedNote HiLab x2 KwaiPilot PowerInfer MetaStoneTec Trillion Labs IBM Granite ScienceOne AI kpsss34 X-Omni Tencent BAC MiSpeech Wan AI The Common Pile
Latest open artifacts (#13): The abundance era of open models Mostly thanks to Qwen, but now we're spoiled for choice and winds are shifting. interconnects.ai/p/latest-op…
2
11
1,673
The AI research job market shit show (and my experience) There are plenty of jobs, but finding a place where you're happy is as hard as ever. Read here: interconnects.ai/p/ai-resear…
2
9
1,576
SB 1047, AI regulation, and unlikely allies for open models The rallying of the open-source community against CA SB 1047 can represent a turning point for AI regulation. interconnects.ai/p/sb-1047-a…
2
1
8
2,960