Weights & Biases · Jun 23, 2026 · 7:56 PM UTC

Weights & Biases

Pinned Tweet

Weights & Biases

@wandb

Jun 23

Most teams training RL agents optimize for tokens per second. For RL, that's the wrong number to chase. So we rebuilt our backend around trajectories per second. 💥 Meet AOM, a Megatron backend for our open-source library ART, with 12X the throughput of our old Unsloth backend.

29,058

Weights & Biases · Jun 29, 2026 · 2:36 PM UTC

Weights & Biases

@wandb

Introducing CoreWeave ARIA, the first AI research agent that runs autoresearch in your W&B dashboard. It reads your runs, finds what's working, and launches the next experiment itself. See it on @karpathy's nanochat, proposing configs and launching real training runs. Watch👇 Chapters 0:00 The setup, nanochat runs on an A100 0:32 Ask ARIA to run autoresearch 1:00 Spin up a second ARIA in parallel 1:38 ARIA inspects prior runs and forms hypotheses 2:03 What happened, what worked, across every run 2:24 ARIA proposes configs, 3 trials via W&B Launch 2:59 Push for bigger architecture changes 3:14 Runs hit the launch queue on the A100 4:09 Two ARIAs running autoresearch in parallel 4:26 Results back, ARIA evaluates the val loss

75,727

more replies

Weights & Biases · Jun 29, 2026 · 2:36 PM UTC

Weights & Biases

@wandb

Here are some use cases to get you going: - Why did my run fail?! 🫠 - Build a view of my runs and outliers - Why did these OOM? Check the logs and metrics - What hyperparams to try next? Launch it - Find a teammate's run and import it - How do I overlay metrics on a chart?

The user asks "what happened?" from a failed run's log page. ARIA already knows the run, sees the CUDA OOM error in the logs, and starts investigating. No need to name the run or describe the problem.

ALT The user asks "what happened?" from a failed run's log page. ARIA already knows the run, sees the CUDA OOM error in the logs, and starts investigating. No need to name the run or describe the problem.

299

Weights & Biases · Jun 29, 2026 · 2:36 PM UTC

Weights & Biases

@wandb

You're already sitting on the experiments. ARIA turns them into the next model. It works on mobile, too. 😉 Public preview is live now. Open any W&B project, hit the agent icon, and ask ARIA something real. More info below! utm.io/uq0Lc

Introducing CoreWeave ARIA: AI Research and Iteration Agent

Your experiments are already tracked. Now let the agent read them, analyze them, and help you turn every experiment into continuous improvement.

wandb.ai

285

Yifei Hu · Jun 28, 2026 · 7:23 PM UTC

Weights & Biases retweeted

Yifei Hu

@hu_yifei

Jun 28

Work life balance

3,404

nick · Jun 28, 2026 · 3:01 AM UTC

Weights & Biases retweeted

nick

@thecsguy

Jun 28

Replying to @snwy_me

If you haven't heard of wandb.. where have you been?

1,147

Drew Svensson · Jun 28, 2026 · 12:23 PM UTC

Weights & Biases retweeted

Drew Svensson @drewsvquant

Jun 28

Replying to @TonyMazur

me refusing to share my wandb logs out of pure paranoia 😭

658

csgm · Jun 24, 2026 · 2:35 PM UTC

Weights & Biases retweeted

csgm

@csgbwk

Jun 24

(all my homies love @wandb )

760

Lorenzo Roller · Jun 26, 2026 · 6:07 PM UTC

Weights & Biases retweeted

Lorenzo Roller

@TheCodingSoup

Jun 26

Colorado's weather has been brutal with tornadoes and baseball-sized hail. So I built a research agent to dig into how severe weather actually gets forecast, and to help me understand severe weather more. 100% on GLM 5.2 via @wandb Serverless Inference, running on @CoreWeave. Every step traced in Weave's new agent view every model + tool call, captured.

781

Weights & Biases · Jun 26, 2026 · 4:01 PM UTC

Weights & Biases

@wandb

Jun 26

What happens when you optimize your AI agent for customer satisfaction? Say a shipping company deploys an LLM trained to get thumbs up. Someone calls asking where their lost package is. The system can admit it's lost or say it's coming tomorrow. Saying the latter would make the customer happy and the agent would earn a thumbs up for lying. @profdanklein on Gradient Dissent: that's not a bug, but a reward function working exactly as intended. Full conversation in the comments.

1,436

Weights & Biases · Jun 26, 2026 · 4:01 PM UTC

Weights & Biases

@wandb

Jun 26

YouTube: wb.oia.bio/140yt Apple Podcasts: wb.openinapp.link/140ap Spotify: wb.oia.bio/140s

360

Alex Volkov @ AI Engineer · Jun 25, 2026 · 6:30 PM UTC

Weights & Biases retweeted

Alex Volkov @ AI Engineer

@altryne

Jun 25

It's one thing to see peak inference tok/s on @ArtificialAnlys It's a completely different thing to have this sustained across real world usage. @OpenRouter is the best place to see this rn, and... at least for now, @CoreWeave / @wandb is the fastest GLM you can get 👀⚡

2,551

Weights & Biases · Jun 25, 2026 · 6:35 PM UTC

Weights & Biases

@wandb

Jun 25

Registry collection cards used to be a paragraph of plain text. Now they pull the same rich, interactive blocks as Reports, so a collection actually reads like a model card. We also shipped artifact panel grids to compare metrics across versions. 📊

1,404

Weights & Biases · Jun 25, 2026 · 6:42 PM UTC

Weights & Biases

@wandb

Jun 25

We are rolling this out to SaaS, and will plan to ship to server in v0.83 globally.

401

Dan Roth · Jun 25, 2026 · 1:58 PM UTC

Weights & Biases retweeted

Dan Roth

@roth_dan

Jun 25

LLMs lie. We build models that tell the truth. Today, we're excited to announce our $100M Series A led by @vkhosla and @KhoslaVentures. @profdanklein and I founded @ScaledCognition to solve the key challenge in AI, reliability.

418,522

CoreWeave · Jun 24, 2026 · 8:06 PM UTC

Weights & Biases retweeted

CoreWeave

@CoreWeave

Jun 24

Open weights just caught up to the frontier. GLM-5.2 from @Zai_org tops the open-model rankings on @ArtificialAnlys and @arena's Agent Arena. It's now live on CoreWeave Serverless Inference at $1.39 in and $4.40 out per 1M tokens. Ship more for less.

149

16,229

Weights & Biases · Jun 24, 2026 · 4:00 PM UTC

Weights & Biases

@wandb

Jun 24

It's really interesting how @profdanklein thinks about hallucinations. His argument: every output from an LLM is technically a hallucination; some just happen to be right. No LLM ever knows whether its answer is right, where the information came from, or how reliable it is. So every answer your AI gives you is probably just a bet.

1,545

Weights & Biases · Jun 24, 2026 · 4:00 PM UTC

Weights & Biases

@wandb

Jun 24

Full episode of Gradient Dissent: YouTube: wb.oia.bio/140yt Apple Podcasts: wb.openinapp.link/140ap Spotify: wb.oia.bio/140s

368

csgm · Jun 24, 2026 · 2:24 PM UTC

Weights & Biases retweeted

csgm

@csgbwk

Jun 24

Night Night! Hope you grow up to have more than 50% success rate!

1,026