rahul · Oct 9, 2025 · 8:56 PM UTC

rahul

rahul

@rahulgs

9 Oct 2025

here’s how we spent 1T tokens at ramp 200 tokens - receipt processing 350 tokens - invoices 790 tokens - reimbursements 999,999,998,660 tokens - accidentally pushed the api key to github

182

7,688

494,899

rahul · Feb 24, 2025 · 7:33 PM UTC

rahul

@rahulgs

24 Feb 2025

Anthropic's AI Engineer source code is fully public / there is no server there is no separate backend. they just use the same api.anthropic.com/v1/message… api in a loop with tool use all packaged into a single file: gist.githubusercontent.com/1… tools available: 1. dispatch_agent - Creates a specialized agent for searching files and code with access to GlobTool, GrepTool, LS, View, and ReadNotebook tools. 2. Bash - Executes bash commands in a persistent shell session. Can't use search commands like find/grep or read tools like cat/ls. 3. GlobTool - Fast file pattern matching using glob patterns like "**/*.js". 4. GrepTool - Searches file contents using regular expressions. 5. LS - Lists files and directories in a given absolute path. 6. View - Reads files from the filesystem (up to 2000 lines). 7. Edit - Edits parts of files by replacing text with new text. 8. Replace - Overwrites entire files with new content. 9. ReadNotebook - Reads Jupyter notebook files (.ipynb). 10. NotebookEditCell - Edits specific cells in Jupyter notebooks. 11. StickerRequest - Displays a shipping form for Anthropic/Claude stickers.

Anthropic

@AnthropicAI

24 Feb 2025

Introducing Claude 3.7 Sonnet: our most intelligent model to date. It's a hybrid reasoning model, producing near-instant responses or extended, step-by-step thinking. One model, two ways to think. We’re also releasing an agentic coding tool: Claude Code.

129

1,648

368,014

rahul · May 1, 2023 · 10:01 PM UTC

rahul

@rahulgs

1 May 2023

Problem: Getting LLMs to output valid JSON in the format you want is hard Solution: ONLY generate values, feed model with keys and JSON structure. Constrain outputs with custom sampling New project: Jsonformer: A Bulletproof Way to Generate Structured JSON from Language Models!

130

1,238

378,956

rahul · Mar 19, 2025 · 12:52 PM UTC

rahul

@rahulgs

19 Mar 2025

got @anthropicai Claude Code working with OpenAI models lol i set up an proxy server that mimics the anthropic /v1/messages api, forwards requests to OpenAI maps: - Sonnet 3.7 -> 4o - Haiku 3.5 -> 4o-mini Sonnet3.7 is still better than gpt-4o at agentic and coding tasks, able to run longer sessions, follow instructions more closely, and bring tasks to completion. Will be testing 4.5 and o3-mini today source below:

1,029

147,865

rahul · Apr 9, 2025 · 8:43 PM UTC

rahul

@rahulgs

9 Apr 2025

added gemini 2.5 pro support to Claude Code feels faster and smarter than Sonnet 3.7 my go to local coding assistant now link below ⬇️

892

97,745

rahul · Apr 16, 2025 · 8:35 PM UTC

rahul

@rahulgs

16 Apr 2025

openai has to tell codex which codex it is to avoid confusion 😭 spotted in the codex system prompt

Sam Altman

@sama

14 Apr 2025

how about we fix our model naming by this summer and everyone gets a few more months to make fun of us (which we very much deserve) until then?

735

71,412

rahul · Jul 3, 2024 · 2:41 PM UTC

rahul

@rahulgs

3 Jul 2024

at @tryramp we use LLMs to find the 5 most valuable mins of audio from the 1000+ customer calls we make every day narrated by TTS + compiled into a 5 min podcast sent to the entire team

Steve Krouse

@stevekrouse

2 Jul 2024

as a product owner it'd be nice to have an llm summary of everything my users did yesterday calling out cool success stories or troublesome error states i should reach out to debug has anyone tried such a thing? i am thinking about prototyping it with public val town data

539

237,894

rahul · Jul 5, 2024 · 7:49 PM UTC

rahul

@rahulgs

5 Jul 2024

🤔 extracted the full ~5000 token claude3.5sonnet claude.ai system prompt: gist.github.com/1rgs/b31a1de… this is a great template for function calling / tool use notes: artifacts: seem to be a fully in-context abstraction, model not finetuned for it allowed types: markdown, html, svg, react (tailwind, lucide-react, recharts, shadcn/ui), mermaidjs. 8 fewshot examples, all types + example of not using an artifact good artifacts are >15 lines, modifiable, self-contained, external use avoid artifacts: simple, explanatory, conversational, context-dependent one artifact per message unless requested prefer in-line content artifact steps: think in <ant_thinking>, wrap in <ant_artifact> with identifier, title, type artifact types: application/vnd.ant.code (code, specify language), text/markdown, text/html, image/svg+xml, application/vnd.ant.mermaid, application/vnd.ant.react complete content, no truncation err on not creating artifact if unsure claude: created by anthropic, current date, 2024, knowledge updated april 2024 claude: no urls or videos, provides info regardless of views, sensitive topics carefully, helps with analysis, coding, writing, teaching, discussion, uses markdown for code claude: face-blind in images, describes without identifying humans claude 3 family: haiku (fast), opus (writing/complex tasks), 3.5 sonnet (most intelligent) claude: thorough for complex, concise for simple, responds in user's language full prompt: gist.github.com/1rgs/b31a1de…

412

53,291

rahul · Jul 17, 2024 · 2:08 PM UTC

rahul

@rahulgs

17 Jul 2024

introducing 🪄genweb: the first software 2.0 web framework 🪄 genweb is a new way of building web apps: instead of a frontend and backend codebase, an LLM is the backend and the frontend it interprets user actions and dynamically generates UI in real-time welcome to the simulation genweb.rahul.gs/app?appId=e3… jensen said apps will be generated not rendered, this is the start

Andrej Karpathy

@karpathy

30 Jun 2024

100% Fully Software 2.0 computer. Just a single neural net and no classical software at all. Device inputs (audio video, touch etc) directly feed into a neural net, the outputs of it directly display as audio/video on speaker/screen, that’s it.

359

100,523

rahul · Aug 26, 2024 · 2:38 PM UTC

rahul

@rahulgs

26 Aug 2024

we're hiring full stack engineers to work on llms at ramp if interested, dm me with examples of real things you've built come work on real deployments and learn how to drive enterprise value we're a small and mighty team what we've worked on in the last year: - multi-step agents for document extraction / ocr (sota accuracy, probably). llm agents + constraint solvers - low-latency next action prediction in our web app (more soon) - ramp tour guide: nitter.app/tryramp/status/1792659… - web agents for solving c*ptchas - codebase import cycle removal with ast parsing/graph cutting algorithms + llms in our python monolith backend - sales outbound automation and lead scoring agents - llm model routing between third party providers (+per feature cost tracking) - llm infra: embedding/reranking/generation finetuning and on-prem deployment/inference - structured extraction (github.com/1rgs/jsonformer) - customer feedback extraction from meeting recordings / routing to marketing - internal tools for: underwriting team/product team/sales/customer support teams - global search / function calling copilot - receipt matching (retrieval) - sms llm interface (function calling) - suggested memos - automated accounting coding - natural language report generation + more

Ramp

@tryramp

20 May 2024

Introducing Ramp Tour Guide: an AI Agent that can show you how to do anything on Ramp! Today, we'd like to share a sneak peek of Ramp's near future. As Ramp grows in functionality, we want to make all of it easily accessible to all of our customers. To do that, we're demoing a first-of-its-kind AI agent that can show and tell people how to accomplish anything with Ramp. The Ramp Tour Guide knows Ramp inside and out. You can ask it how to do something on our platform and it'll walk you through every step of the way.

326

82,496

rahul · Apr 18, 2023 · 9:49 PM UTC

rahul

@rahulgs

18 Apr 2023

🎉 new project: Clarity! A reading app that offers a fresh approach to consuming text. Instead of the traditional linear reading style, Clarity allows you to read depth-first, diving into the details that interest you most.

304

56,625

rahul · Apr 14, 2025 · 7:18 PM UTC

rahul

@rahulgs

14 Apr 2025

246

26,455

rahul · Jul 5, 2024 · 6:37 PM UTC

rahul

@rahulgs

5 Jul 2024

claude.ai generates CoT tokens within <antThinking> tags, hidden from user on the server

225

53,895

rahul · Apr 1, 2024 · 11:56 PM UTC

rahul

@rahulgs

1 Apr 2024

got Devin to fix bugs in OpenDevin github.com/OpenDevin/OpenDev…

214

24,216

rahul · May 23, 2025 · 8:50 PM UTC

rahul

@rahulgs

23 May 2025

vibe deleting stuff to clear up space with claude code

171

40,136

rahul · Jun 26, 2023 · 4:10 PM UTC

rahul

@rahulgs

26 Jun 2023

Excited to announce that we're joining forces with one of our customers, @tryramp, where we will help build the future of AI + finance

Mary Ann Azevedo

@bayareawriter

26 Jun 2023

Exclusive: @tryramp makes its 2nd acquisition, scooping up Cohere.io, which has built out an AI-powered customer support tool. techcrunch.com/2023/06/26/as…

150

37,966

rahul · Apr 9, 2025 · 8:43 PM UTC

rahul

@rahulgs

9 Apr 2025

github.com/1rgs/claude-code-… @JeffDean @OfficialLoganK

GitHub - 1rgs/claude-code-proxy: Run Claude Code on OpenAI models

Run Claude Code on OpenAI models. Contribute to 1rgs/claude-code-proxy development by creating an account on GitHub.

github.com

144

6,277

rahul · Nov 12, 2023 · 11:33 PM UTC

rahul

@rahulgs

12 Nov 2023

I kept having to debug prompt issues with open models So I built OpenAI's Tokenizer page for all tokenizers on HuggingFace: Llama, Mistral, GPT2, MPT, Persimmon, T5 etc github.com/1rgs/tokenwiz check it out here: tokenwiz.rahul.gs

141

21,536

rahul · Aug 6, 2024 · 6:17 PM UTC

rahul

@rahulgs

6 Aug 2024

jsonformer + openai! openai.com/index/introducing… "deterministic, engineering-based approach to constrain the model’s outputs to achieve 100% reliability"

rahul

@rahulgs

1 May 2023

143

26,776

rahul · Oct 14, 2024 · 7:27 PM UTC

rahul

@rahulgs

14 Oct 2024

Sam Lambert

@samlambert

14 Oct 2024

Uber runs 16,000 MySQL nodes. Actual scale. uber.com/en-JO/blog/upgradin…

139

11,580

rahul · May 15, 2025 · 1:19 PM UTC

rahul

@rahulgs

15 May 2025

at @sequoia ai ascent last week i spoke about why ai agents, even from companies like @microsoft @apple @google, fail + how to solve it with a simple fix

142

33,924

rahul · Aug 19, 2021 · 4:33 PM UTC

rahul

@rahulgs

19 Aug 2021

me: can you pass the water homebrew: updating homebrew

rahul · Oct 12, 2023 · 2:56 PM UTC

rahul

@rahulgs

12 Oct 2023

My favorite thing to do on Modal - running massively parallel GPU finetune jobs At Ramp, we’ve trained hundreds of LLMs *at the same time* without the infra hassle - Modal allows us to move insanely fast (1/2)

Modal

@modal

10 Oct 2023

Modal is generally available today, and we also raised a Series A! modal.com/blog/general-avail…

123

53,757

rahul · Mar 19, 2025 · 12:52 PM UTC

rahul

@rahulgs

19 Mar 2025

try it here: github.com/1rgs/claude-code-… run with: `ANTHROPIC_BASE_URL=http://localhost:8082 claude`

GitHub - 1rgs/claude-code-proxy: Run Claude Code on OpenAI models

Run Claude Code on OpenAI models. Contribute to 1rgs/claude-code-proxy development by creating an account on GitHub.

github.com

129

7,848

rahul · Oct 20, 2025 · 5:58 PM UTC

rahul

@rahulgs

20 Oct 2025

literally free productivity, most people one shot themselves by working or sleeping in rooms with high levels of CO2 buy a monitor that tracks levels over time so you can see how high it went when you were asleep, aranet makes a good one

The Peel

@ThePeelPod

20 Oct 2025

I asked Erik @bernhardsson why high CO2 levels in your office are such a big deal: "I'm not a health nut. But one of the things I've been radicalized on is CO2 levels. There's a real relationship between CO2 levels, productivity, and cognitive performance. And CO2 levels are usually way too high in offices and schools. Normal air CO2 levels are 300-500 PPM. In offices it often hits 1,000 or 2,000. Airplanes can get up to 2,500. You start getting brain damage at 5,000. So I went and bought a bunch of CO2 monitors for the office. And I look at them every day. And I open windows anytime we get too high."

128

55,722

rahul · Jun 6, 2025 · 3:13 PM UTC

rahul

@rahulgs

6 Jun 2025

Replying to @eglyman @calvinleenyc

this is my quant

118

5,755

rahul · May 3, 2023 · 1:06 PM UTC

rahul

@rahulgs

3 May 2023

that’s a lot of stars 🤯

114

11,314

rahul · Feb 24, 2025 · 7:10 PM UTC

rahul

@rahulgs

24 Feb 2025

Looking at @anthropic-ai/claude-code source from NPM there's a sticker tool! ask claude code for free anthropic stickers

Anthropic

@AnthropicAI

24 Feb 2025

110

11,762

rahul · Mar 30, 2023 · 9:58 PM UTC

rahul

@rahulgs

30 Mar 2023

look at this linkedin dm i just got

103

10,616

rahul · Jul 21, 2024 · 5:40 PM UTC

rahul

@rahulgs

21 Jul 2024

Shoutout to Ramp engineer Andrew Gu who was on the coaching staff. Congratulations to Team USA for winning IMO 2024!

John Coogan

@johncoogan

21 Jul 2024

Congratulations, welcome to the Ramp engineering team.

16,449

rahul · Nov 20, 2023 · 7:19 PM UTC

rahul

@rahulgs

20 Nov 2023

the netflix documentary is gonna go crazy

8,229

rahul · Mar 17, 2021 · 1:42 PM UTC

rahul

@rahulgs

17 Mar 2021

so pumped to share the news, and ...we are just getting started! how we got here ↓ (thanks @sarahintampa for the awesome coverage!) techcrunch.com/2021/03/17/co…

Cohere raises $3.1 million for its remote control solution for web apps | TechCrunch

Existing remote desktop solutions like LogMeIn and TeamViewer can be complicated to set up and use, and can feel dated. A new startup called Cohere, now

techcrunch.com

rahul · Sep 5, 2024 · 7:29 PM UTC

rahul

@rahulgs

5 Sep 2024

"jobs not finished" - @eglyman

7,132

rahul · Aug 27, 2025 · 9:39 PM UTC

rahul

@rahulgs

27 Aug 2025

our computer use agent to eliminate finance busywork, first of many to come

Ramp Labs

@RampLabs

27 Aug 2025

Meet Agent Fill, our agentic form filler. Built for finance & ops teams that don't want to waste time filling out PDFs. Available today in alpha.

13,986

rahul · May 1, 2023 · 10:01 PM UTC

rahul

@rahulgs

1 May 2023

Generate perfect schema-conforming JSON, every time: github.com/1rgs/jsonformer

10,255

rahul · Sep 29, 2025 · 6:45 PM UTC

rahul

@rahulgs

29 Sep 2025

thanks to modal sandboxes, everybody gets a free vm on os.rahul.gs each tab is it's own FULL VM running ubuntu dont mine crypto pls

Erik Bernhardsson

@bernhardsson

29 Sep 2025

It's true – @modal has raised a $87M Series B at a $1.1B valuation to advance the future of AI infrastructure. Thank you to @Lux_Capital, @Redpoint, @AmplifyPartners, and others. Now more than ever, AI demands a complete reinvention of traditional compute infrastructure

19,935

rahul · Dec 18, 2023 · 12:33 AM UTC

rahul

@rahulgs

18 Dec 2023

28,618

rahul · Sep 12, 2024 · 5:17 PM UTC

rahul

@rahulgs

12 Sep 2024

with the o1 release, reminder that claude.ai has been using thinking tokens for several months now openai.com/index/introducing…

rahul

@rahulgs

5 Jul 2024

claude.ai generates CoT tokens within <antThinking> tags, hidden from user on the server

9,874

rahul · May 1, 2023 · 10:01 PM UTC

rahul

@rahulgs

1 May 2023

Jsonformer supports a subset of JSON Schema, including number, boolean, string, array, and object types. It's built on top of the HuggingFace transformers library, making it compatible with any model that supports the HuggingFace interface. Try it — github.com/1rgs/jsonformer

GitHub - 1rgs/jsonformer: A Bulletproof Way to Generate Structured JSON from Language Models

A Bulletproof Way to Generate Structured JSON from Language Models - 1rgs/jsonformer

github.com

7,562

rahul · Dec 1, 2021 · 7:16 PM UTC

rahul

@rahulgs

1 Dec 2021

Honored to be part of the 2022 Forbes 30U30 list with my cofounder @yunyu_l for @CohereHQ

rahul · Oct 18, 2024 · 7:18 PM UTC

rahul

@rahulgs

18 Oct 2024

new prompting technique - ask chatgpt to lock in

5,706

rahul · May 4, 2023 · 6:50 PM UTC

rahul

@rahulgs

4 May 2023

New complex schema generation example live With just a tiny 3b model (databricks/dolly-v2-3b) github.com/1rgs/jsonformer/b…

6,411

rahul · Dec 20, 2024 · 3:58 PM UTC

rahul

@rahulgs

20 Dec 2024

Introducing our Vendor Search Tool! No more generic SEO lists. No more fake reviews. No more wasting time on a bunch of different sites. Just real, detailed information sourced from all across the internet — all in one place, all powered by Ramp Intelligence. Find the right vendors for your business: buy.ramp.com

Ramp Vendor Search Tool

Simplify vendor sourcing. Instantly discover the most relevant vendors with key insights summarized from trusted sources.

buy.ramp.com

Ramp

@tryramp

20 Dec 2024

Finding the best vendors used to be hard. Not anymore. Introducing Ramp’s Vendor Search Tool. See pricing, compliance, and growth trends, all in one place. Powered by Ramp Intelligence. Find the best software for your business: buy.ramp.com

7,015

rahul · May 8, 2024 · 10:15 PM UTC

rahul

@rahulgs

8 May 2024

i achieved 100% accuracy on 0.007% of swe bench

Kevin Lu

@kevinlu1248

8 May 2024

Sweep achieves 15.7% on SWE-bench! Hi everyone, we’re building Sweep, an open-source AI developer that handles the easiest 30% of software tasks. We’re thrilled to announce our results on SWE-Bench! We evaluated Sweep on a random 10% subset of the data. Sweep correctly completed 15.7% of issues (1.9% more than Devin)!

9,279

rahul · May 1, 2023 · 11:01 PM UTC

rahul

@rahulgs

1 May 2023

Generating JSON is probably a common enough use case that hosted model providers should probably support an JSON only API thoughts? @gdb @aidangomezzz @AnthropicAI

6,896

rahul · Apr 23, 2023 · 4:12 PM UTC

rahul

@rahulgs

23 Apr 2023

I finetuned an LLM on all my iMessages, try it on yours! releasing code with sql queries, data processing, finetuning with PEFT and a chat CLI github.com/1rgs/MeGPT

7,331

rahul · Apr 3, 2025 · 7:53 PM UTC

rahul

@rahulgs

3 Apr 2025

what really excites me is how many approaches to a problem i can try in parallel if i'm not sure, i just ask devin to try all of them by creating other devins

Cognition

@cognition

3 Apr 2025

Introducing Devin 2.0: a new agent-native IDE experience. Generally available today starting at $20. 🧵👇

5,538

rahul · Oct 4, 2024 · 9:05 PM UTC

rahul

@rahulgs

4 Oct 2024

what did you get done in the last hour

6,030

rahul · Jun 29, 2025 · 7:40 PM UTC

rahul

@rahulgs

29 Jun 2025

range anxiety

3,763

rahul · Nov 19, 2024 · 3:28 PM UTC

rahul

@rahulgs

19 Nov 2024

✅ agent that works ✅ general availability ✅ a real demo

Rox

@rox_ai

19 Nov 2024

We built a B2B SaaS sales company and here’s what it taught us about B2B SaaS sales 🧵👇 (but actually) Today we’re launching Rox, the first publicly available AI agent swarm for the top sales teams, and in the private beta it already helped reps grow their books 30%. 2025 is going to be a huge year for growth. Enterprises are doubling next year’s revenue goals, but no one is doubling team sizes. Every rep will have to bring in more, and AI can help them do that. But there’s a wrong way and a right way. Much of today’s AI aims to replace low-value work. But sales follows a power law: 90% of revenue comes from the top 15% of enterprise sales reps. The greatest gains will come from supercharging the highest value work — raising the ceiling, not the floor. Rox equips the very best with a swarm of AI agents, acting as an army of analysts to help them plan, prioritize, research, engage, and keep up with their customers. Over 35 of the best-performing enterprise sales teams have adopted Rox virally. For example, Ramp has rolled it out to their AE and AM teams, and we are now integrating their internal data systems with Rox. The Enterprise AE team alone gains 225+ hours per week to boost pipeline execution activities. Rox is now in public beta. No barriers. No need to request a demo. Try it now for free → rox.com

4,561

rahul · Sep 5, 2024 · 9:57 PM UTC

rahul

@rahulgs

5 Sep 2024

jobs not finished

Ramp

@tryramp

4 Sep 2024

we have work to do

5,547

rahul · Aug 6, 2024 · 10:31 PM UTC

rahul

@rahulgs

6 Aug 2024

openai: with structured mode vs without in my benchmark, structured extraction mode is 13% slower, samples about the same number of tokens code: gist.github.com/1rgs/4790c32…

4,031

rahul · Sep 5, 2024 · 9:51 PM UTC

rahul

@rahulgs

5 Sep 2024

💪

5,359

rahul · May 15, 2023 · 5:33 PM UTC

rahul

@rahulgs

15 May 2023

huge

LangChain

@LangChain

15 May 2023

❓How to get models to generate structured output? JSONFormer (by @rahulgs) and RELLM (by @mattrickard) are two novel approaches for this, now with (experimental) integrations to LangChain JSONFormer Integration python.langchain.com/en/late… RELLM Integration python.langchain.com/en/late…

11,413

rahul · May 1, 2023 · 10:01 PM UTC

rahul

@rahulgs

1 May 2023

Problem: Generating structured JSON from language models is challenging. Current approaches like prompt engineering, fine-tuning, and post-processing often fail to produce syntactically correct JSON.

8,361

rahul · Aug 18, 2021 · 3:15 PM UTC

rahul

@rahulgs

18 Aug 2021

when i was at superhuman it would bother me immensely when people called us SuperHuman we've come full circle

rahul · Mar 24, 2024 · 12:37 AM UTC

rahul

@rahulgs

24 Mar 2024

it’s time to cook

6,249

rahul · Feb 2, 2025 · 7:15 PM UTC

rahul

@rahulgs

2 Feb 2025

Replying to @AravSrinivas

finetune it to call tools like search and code interpreter within the thinking process

9,187

rahul · Apr 18, 2024 · 6:17 PM UTC

rahul

@rahulgs

18 Apr 2024

1,947

rahul · May 26, 2021 · 5:53 PM UTC

rahul

@rahulgs

26 May 2021

Worked long and hard on this one - incredibly hard to get this just right!

This tweet is unavailable

rahul · May 1, 2023 · 10:01 PM UTC

rahul

@rahulgs

1 May 2023

Solution: Jsonformer: A wrapper around HuggingFace models that only generates content tokens and fills in fixed tokens during the process. This makes it more efficient and bulletproof than existing methods

7,618

rahul · Aug 6, 2021 · 3:21 AM UTC

rahul

@rahulgs

6 Aug 2021

what the

rahul · Jan 29, 2025 · 2:11 PM UTC

rahul

@rahulgs

29 Jan 2025

Jack Ma 5 years ago: “hate that AI is called Artificial Intelligence, I call it Alibaba Intelligence” @elonmusk: “damn might end up being true”

unusual_whales

@unusual_whales

29 Jan 2025

JUST IN: Alibaba, $BABA, has released a new AI that it says is better than $META, OpenAI, and DeepSeek.

9,021

rahul · May 15, 2025 · 1:19 PM UTC

rahul

@rahulgs

15 May 2025

piped.video/watch?v=7Xp-74yZ…

2,004

rahul · Jul 16, 2021 · 5:17 AM UTC

rahul

@rahulgs

16 Jul 2021

A++ customer support from @will_ye_ sign up @CohereHQ and we'll write you a haiku

rahul · Apr 18, 2023 · 9:49 PM UTC

rahul

@rahulgs

18 Apr 2023

Source: github.com/1rgs/clarity-read… Based off of @andy_matuschak's amazing Evergreen notes and @OpenAI's "Recursively Summarizing Books with Human Feedback" (arxiv.org/abs/2109.10862) And huge thanks to @yunyu_l @thesephist for feedback

1,808

rahul · Jul 6, 2021 · 7:04 PM UTC

rahul

@rahulgs

6 Jul 2021

literally everyone under the age of 25 who has invested in Cohere has asked if they can Venmo me the money

rahul · Oct 11, 2024 · 10:10 PM UTC

rahul

@rahulgs

11 Oct 2024

Replying to @shaig

ALT клоун GIF

936

rahul · Aug 8, 2024 · 4:35 PM UTC

rahul

@rahulgs

8 Aug 2024

when I was making this graphic in Figma @yunyu_l told me to change to the latex font so it looks more academic

Teknium 🪽

@Teknium

8 Aug 2024

Thought this said jensenformer

3,515

rahul · Mar 12, 2024 · 11:37 PM UTC

rahul

@rahulgs

12 Mar 2024

this is just the beginning, excited to be a supporter

Cognition

@cognition

12 Mar 2024

Today we're excited to introduce Devin, the first AI software engineer. Devin is the new state-of-the-art on the SWE-Bench coding benchmark, has successfully passed practical engineering interviews from leading AI companies, and has even completed real jobs on Upwork. Devin is an autonomous agent that solves engineering tasks through the use of its own shell, code editor, and web browser. When evaluated on the SWE-Bench benchmark, which asks an AI to resolve GitHub issues found in real-world open-source projects, Devin correctly resolves 13.86% of the issues unassisted, far exceeding the previous state-of-the-art model performance of 1.96% unassisted and 4.80% assisted. Check out what Devin can do in the thread below.

5,195

rahul · Apr 23, 2021 · 12:36 PM UTC

rahul

@rahulgs

23 Apr 2021

🤯 our most requested feature is out!

Cohere @CohereHQ

23 Apr 2021

Announcing Cohere Voice! The same frictionless experience, now with audio and video. It’s that easy. 🎤📹 cohere.so/voice

rahul · May 4, 2021 · 2:13 AM UTC

rahul

@rahulgs

4 May 2021

me: looks like a 30 minute feature, quick n easy also me 5 hours later:

rahul · Jul 3, 2024 · 3:05 PM UTC

rahul

@rahulgs

3 Jul 2024

here’s an example (voices and quotes altered)

5,618

rahul · Feb 3, 2025 · 12:29 AM UTC

rahul

@rahulgs

3 Feb 2025

called it

rahul

@rahulgs

2 Feb 2025

Replying to @AravSrinivas

finetune it to call tools like search and code interpreter within the thinking process

5,243

rahul · Mar 19, 2025 · 9:28 PM UTC

rahul

@rahulgs

19 Mar 2025

was an honor to drop some hot takes on stage @aiDotEngineer thanks for having me @swyx!

swyx @aiDotEngineer WF Day 1

@swyx

19 Mar 2025

THE BITTER LESSON APPLIED TO AGENTS (aka how to not be steamrolled by GPTNext) Ramp just hit a $13b valuation and "every surface of Ramp is infused with AI" TL;DR of @rahulgs' very well constructed @aidotengineer talk as a syllogism 1. systems that scale with compute beat systems that don't 2. you should build systems such that they improve with more compute 3. exponentials are rare: when you find one, -actually- hop on for the ride (instead of subconsciously fighting it out of habit/fear) 4. therefore allow the agent to flex tools and self augment/improve rather than constrain it has everything: single message, real life usecase from a major company, and LIVE DEMO (on conference wifi lol) do not miss

3,783

rahul · Jul 3, 2024 · 2:46 PM UTC

rahul

@rahulgs

3 Jul 2024

this is how you talk to your users at scale

6,408

rahul · Mar 25, 2023 · 5:33 PM UTC

rahul

@rahulgs

25 Mar 2023

didn't get access to copilot x yet so I wrote my own with gpt4 try bropilot, a rust cli that helps you write terminal commands github.com/1rgs/bropilot/tre…

2,234

rahul · Oct 10, 2024 · 4:21 PM UTC

rahul

@rahulgs

10 Oct 2024

“In 15 words: deep learning worked, got predictably better with scale, and we dedicated increasing resources to it.” - Gandhi

calix

@calixo888

10 Oct 2024

one of the hardest of parts of building a good agentic UX is integrating to the user's context. automation only works when we can make intelligent decisions without requiring a user to put in extra work. proud to have led this project alongside many others at @tryramp 🤝

3,637

rahul · Sep 11, 2024 · 6:11 PM UTC

rahul

@rahulgs

11 Sep 2024

agree, 100% a mistake

Kushal Byatnal

@kushalbyatnal

11 Sep 2024

Klarna using AI to rip out Salesforce and Workday is pretty magical at first glance.... but I've also seen this before: - company sees 7-fig Datadog bill - kicks off internal build to "save millions of dollars!" - staffs up team of eng - 6 months later, realizes their mistake 🧵

3,829

rahul · Jul 17, 2024 · 2:08 PM UTC

rahul

@rahulgs

17 Jul 2024

every visit to a genweb app goes straight to an llm, which renders the initial page in html all “code” is in natural language, which is “interpreted” by an llm real time user interactions are piped back into llm, which "rerenders" the page every user session is a multi-turn LLM conversation. here's an example:

3,300

rahul · May 23, 2025 · 9:21 PM UTC

rahul

@rahulgs

23 May 2025

Replying to @reallyrawn

actually yeah

987

rahul · Oct 19, 2021 · 3:35 PM UTC

rahul

@rahulgs

19 Oct 2021

Replying to @will__ye

psa: this screenshot is PHOTOSHOPPED

rahul · Jan 26, 2021 · 11:18 PM UTC

rahul

@rahulgs

26 Jan 2021

thank you @rememberlenny, this is why we do what we do

rahul · May 11, 2023 · 5:33 PM UTC

rahul

@rahulgs

11 May 2023

next few years are going to be crazy if you're curious how many tokens are in your codebase: github.com/1rgs/token-trekke… A bunch of our repos fit in one context window 🤯

GitHub - 1rgs/token-trekker-rs

Contribute to 1rgs/token-trekker-rs development by creating an account on GitHub.

github.com

Anthropic

@AnthropicAI

11 May 2023

Introducing 100K Context Windows! We’ve expanded Claude’s context window to 100,000 tokens of text, corresponding to around 75K words. Submit hundreds of pages of materials for Claude to digest and analyze. Conversations with Claude can go on for hours or days.

3,657

rahul · Jul 17, 2024 · 2:08 PM UTC

rahul

@rahulgs

17 Jul 2024

unlike traditional AI code generation (eg copilot, chatgpt, claude artifacts, devin), which outputs code, genweb is the LLM itself llm -> code -> app ❌ llm -> app ✅ no js, no backend code - just natural language instructions and an LLM that simulates it

6,395

rahul · Aug 11, 2021 · 8:16 PM UTC

rahul

@rahulgs

11 Aug 2021

With Chime, we're bringing the magic of Cohere's seamless customer interaction tools to sales and marketing teams — super excited to get this out

This tweet is unavailable

rahul · Jul 17, 2024 · 2:08 PM UTC

rahul

@rahulgs

17 Jul 2024

genweb is a proof of concept for now, but with faster models and cheaper inference, this could soon be how all software is made software 2.0 apps are malleable and squishy, not rigid and rules-based like it is today (1) not every feature needs to be described, and the model fills in the gaps with “common sense” (2) every user gets their own custom ui, tailored to their attributes, even with the same “source code” here’s a playground to build your own genweb apps: genweb.rahul.gs/ github.com/1rgs/genweb

2,237

rahul · Apr 29, 2024 · 6:03 PM UTC

rahul

@rahulgs

29 Apr 2024

was able to get access without getting off the waitlist: copilot-workspace.githubnext… /<owner>/<repo>?task=<description>

Thomas Dohmke

@ashtom

29 Apr 2024

What started out as an autocomplete pair programmer is now redefining the developer experience itself. Welcome to @GitHub Copilot Workspace: The Copilot-native developer environment — a place for all to create with code instantly in natural language. github.blog/2024-04-29-githu…

6,885

rahul · Oct 29, 2020 · 9:16 PM UTC

rahul

@rahulgs

29 Oct 2020

from interviewing me for my first ever job at Superhuman to writing our first check at Cohere, Vivek has been a great mentor/supporter 🙏 thank you @vsodera — wouldn't be here w/o you

Vivek Sodera

@vsodera

29 Oct 2020

Proud to be one of the first investors in @CohereHQ. Their pixel-perfect screensharing experience is 🤯! If you're a head of support, customer success, QA, onboardings, or sales, and want to use Cohere at your company, DM me. /cc @yunyu_l @rahulgs @jasonhfwang

rahul · Jul 26, 2024 · 10:25 PM UTC

rahul

@rahulgs

26 Jul 2024

building a synthetic ramp this weekend with web session replay data

2,188

rahul · Jan 28, 2021 · 4:24 PM UTC

rahul

@rahulgs

28 Jan 2021

🤯 @copy_ai @chris__lu @PaulYacoubian

rahul · May 6, 2021 · 5:38 PM UTC

rahul

@rahulgs

6 May 2021

🔥🔥🔥 @GhorbaniAmir

rahul · Jun 20, 2024 · 8:58 AM UTC

rahul

@rahulgs

20 Jun 2024

teaching llama3 to reason in "grid" with synthetic data 👀

2,498

rahul · Oct 12, 2021 · 7:58 PM UTC

rahul

@rahulgs

12 Oct 2021

when other founders ask about engineers you want to hire ft @hankai1998

rahul · Apr 14, 2025 · 6:59 PM UTC

rahul

@rahulgs

14 Apr 2025

4.1 is the default model on Claude Code Proxy now github.com/1rgs/claude-code-… feedback coming soon

GitHub - 1rgs/claude-code-proxy: Run Claude Code on OpenAI models

Run Claude Code on OpenAI models. Contribute to 1rgs/claude-code-proxy development by creating an account on GitHub.

github.com

1,836

rahul · Oct 14, 2025 · 5:20 PM UTC

rahul

@rahulgs

14 Oct 2025

what should we do with this nanochat.modal.ramp.engineer…

5,338

rahul · Jan 22, 2025 · 6:14 PM UTC

rahul

@rahulgs

22 Jan 2025

no mo slo mo

2,551

rahul · Aug 14, 2021 · 8:58 PM UTC

rahul

@rahulgs

14 Aug 2021

vc: what’s ur mrr me: haha we’re not sharing rn vc: haha how many customers do u have and what is the average deal size

rahul · Jun 11, 2024 · 11:20 PM UTC

rahul

@rahulgs

11 Jun 2024

"LLMs are not enough" is going to age extremely poorly from @leopoldasch

Mike Knoop

@mikeknoop

11 Jun 2024

No one can beat the 2019 ARC-AGI benchmark. We've stalled. LLMs are not enough. Frontier research has gone closed source. We need new ideas. Maybe from you? Thrilled to announce @arcprize with @fchollet A $1,000,000 competition to beat ARC and re-start open AGI progress

5,869

rahul · Jun 12, 2025 · 9:20 PM UTC

rahul

@rahulgs

12 Jun 2025

Replying to @will__ye

bro is a quick learner

1,322