OB-1 is a frontier, self-improving coding agent. Now available for general access!

San Francisco, CA
Introducing Connections. Set it up once → every OB-1 session just works. Connect: • Individual accounts (e.g. @linear) • Org accounts (e.g. @sentry) Plus: @github @braintrust @baseten @stripe …and more Powered by @WorkOS See OB-1 with @linear below!
4
5
21
8,453
One setup in the dashboard. After that: → any engineer → any CLI session → any agent can securely use those connections. Here’s OB-1 fixing issues directly from @sentry
2
2
1,490
OpenBlock retweeted
Benchmarks, Accountability, and What Matters Our Terminal Bench submission did not meet the standard we set for ourselves. We made real improvements to our agent harness for the benchmark, but we also resorted to methods that compromised our results. The methodology was wrong, and we take full accountability. Benchmarks have become a huge focus for our industry, driving launch posts and informing which agents get adopted. At the same time, benchmaxxing is rampant. Several of the highest-ranked submissions on Terminal Bench today actively inject task-specific guidance, cherry-pick trials, and refuse to publish trajectories. We anticipate many more will be removed soon, but the deeper issue is systemic. We got caught up in the race, and that was a mistake. This is a turning point for us, and maybe for others, to focus on what matters outside of benchmarks: building a product people love. We’ve built an incredible, high-caliber team that has spent the last six months heads-down building a frontier agent, and the work speaks for itself: - Cloud sandboxes that run your code in isolation - Auto-generated skills and hooks based on your past sessions - Fine-tuned subagent models purpose-built for subtasks - Session sharing so your team can pick up where you left off - Hands-off mode with built-in safety controls - Support for 300+ models - PM Mode for planning specs - and much more We’re committed to doing better going forward, which means focusing on transparency and verifiability. It’s been an important week to reflect, but it’s time to get back to building. — Daljeet & Tejpal
3
16
2,410
OB-1 now supports BYOK! Just add your keys from OpenAI or Anthropic and you're ready to go.
20
10
97
17,720
Introducing PM Mode in OB-1! Run /init-pm and OB-1 will analyze your integrations to learn how your company works. Switch to PM Mode (shift+tab), and you'll never run out of ideas on what to build. See it in action for OpenClaw 👇
4
7
42
9,004
The OB-1 free tier is back up with $10/day in credit! We onboarded ~10k users yesterday, which briefly strained the system. After removing some spam accounts, everything is running normally again. If you are running into issues, please report via /bug in CLI or joining Slack.
10
3
96
10,997
OpenBlock retweeted
Your coding agent just got its own computer. ob1 --sandbox Powered by Modal.
22
30
327
46,715
OpenBlock retweeted
The best coding agents don’t just write code, they ship it 🚀 All OB-1 sessions now include the @Vercel CLI as a preloaded skill: deploy, preview, and manage projects without leaving the agent.
7
6
44
49,889
Here’s where OB-1 is going: – Auto-generates evals from past PRs, then climbs them with custom models – Builds its own skills, hooks, and rules from a codebase and session history – Background agents in safe sandboxes that keep working while you context-switch – Session sharing and forking: redefining version control around prompts, instead of source code – Lives where you already work: Slack, Linear, GitHub, Graphite – PM mode so it never runs out of ideas
3
3
63
24,449
Today’s coding agent teams still employ hundreds of human engineers, which we find telling. We’ve kept our team small, consisting entirely of IOI/IMO medalists, to make one bet: OB-1 will build OB-1 faster than any human team. We’re just getting started. Be sure to follow @openblocklabs for future updates.
1
2
24
14,861
The best coding agents don’t just write code, they ship it 🚀 All OB-1 sessions now include the @Vercel CLI as a preloaded skill: deploy, preview, and manage projects without leaving the agent.
7
6
44
49,889
2/ OB-1 is a self-improving coding agent currently in beta. It placed #1 on Terminal Bench in September. We’re letting people off the waitlist each day - join here: openblocklabs.com/waitlist
4
2,192
OpenBlock retweeted
Coding agents 💚 Modal Sandboxes
Your coding agent just got its own computer. ob1 --sandbox Powered by Modal.
1
7
104
16,275
Your coding agent just got its own computer. ob1 --sandbox Powered by Modal.
22
30
327
46,715
2/ Most coding agents run directly on your machine: eating memory, slowing your computer down, and even crashing your terminal. --sandbox moves all of that off your laptop and into an isolated cloud environment on @modal Your agent gets its own machine with your repo and local environment cloned instantly.
1
7
2,635
3/ OB-1 is a self-improving coding agent currently in beta. It placed #1 on Terminal Bench in September. We’re letting people off the waitlist each day. Join here: openblocklabs.com/waitlist
6
2,392
OpenBlock retweeted
OB-1 now reads .agents/skills One shared skills folder, usable across agents. @openblocklabs @openai @embirico @tibo
1
6
1,193
OpenBlock retweeted
I’ll be at NeurIPS in San Diego this year! Reach out if you want to talk about coding agents (+ our upcoming CLI launch @openblocklabs), domain-specific RL, open-source.
1
3
8
2,389
OpenBlock retweeted
So much fun hosting the CMU builder night tonight; packed with demos, energy, and great people. Gave a sneak peek of @openblocklabs' upcoming CLI agent, OB-1! s/o @waynesutton @convex for the space :)
1
4
21
2,849