Cartesia · Jun 15, 2026 · 5:51 PM UTC

Cartesia

Pinned Tweet

Cartesia

@cartesia

Jun 15

Two new models just dropped 👀 Sonic-3.5 and Ink-2 are the #1 streaming models for text to speech and speech to text

Karan Goel

@krandiash

Jun 15

We released Sonic-3.5 and Ink-2, the #1 streaming models for text to speech and speech to text you can use in your voice agents today. New architectures enable new frontiers for speed and quality. We're now the only provider to have #1 models for both speaking and listening.

102

14,876

Cartesia · Jun 17, 2026 · 8:03 PM UTC

Cartesia

@cartesia

Jun 17

Sonic-3.5 is also the #1 streaming TTS model on @voicearena_ai for notable languages including English🇺🇸, Hindi 🇮🇳, Portuguese 🇧🇷

Voice Arena

@voicearena_ai

Jun 17

Cartesia Sonic 3.5 is the #1 streaming TTS model on Voice Arena US English Leaderboard. In the overall leaderboard (streaming + non-streaming) it jumped from rank #9 → #2, moving ahead of Grok TTS, ElevenLabs v3 & OpenAI's gpt-4o-mini-tts. Sonic-3.5 is the latest TTS model from @cartesia . It supports 42 languages, with 500+ voices available out of the box. The model has been highly preferred among raters on @voicearena_ai . Results backed by 11,110 blind, head-to-head listener votes

4,483

bolna · May 29, 2026 · 12:05 PM UTC

Cartesia retweeted

bolna

@bolna_dev

May 29

Announcing the Bolna × Cartesia VOC-A-THON! Calling the most cracked builders in Voice AI. Come ship voice agents powered by Sonic 3.5, Cartesia's most natural and expressive TTS model yet. - Build with Sonic 3.5, Cartesia's newest TTS model - @bolna_dev and @cartesia teams in the room - Free Sonic 3.5 & Bolna credits for every participant - Exciting prizes for the best voice agents Apply now: luma.com/ubu85bxv

4,160

Cartesia · May 28, 2026 · 4:50 PM UTC

Cartesia

@cartesia

May 28

Cartesia Ink-2 debuts as #1 for accuracy on the brand-new streaming speech-to-text leaderboard from @ArtificialAnlys! We designed Ink-2 from the ground up for voice agents - with low latency, eager transcripts, and semantic endpointing.

131

67,613

Cartesia · May 28, 2026 · 4:52 PM UTC

Cartesia

@cartesia

May 28

A great speech-to-text model for voice agents first and foremost needs to have high accuracy in production settings - this means noisy environments and conventionally difficult audio like silences, short transcripts, phone numbers, and UUIDs. For the conversation to be smooth, it also needs to have low latency with eager transcripts to reduce end to end response time. Finally, semantic endpointing with high accuracy is critical so they respond appropriately and don't interrupt the user.

2,220

Cartesia · May 28, 2026 · 4:54 PM UTC

Cartesia

@cartesia

May 28

We've built Ink-2 to excel on all of these axes in production. Give it a try on our website cartesia.ai/ink

Cartesia \ Ink

Real-time multimodal intelligence

cartesia.ai

1,860

Cartesia · May 26, 2026 · 8:48 PM UTC

Cartesia

@cartesia

May 26

Cartesia is excited to be the Voice that powers @avaturn_me's new Open Weights AVTR-1 avatar models. Check out the links, repo and docs below 👇

Avaturn

@avaturn_me

May 26

🚨 AVTR-1 New Model is OPEN WEIGHTS . Duplex Native , #1 on benchmarks. Here’s what being released. Links in comments - Model + Paper now on HF - Full Github repo to run it really fast Run it anywhere as low as $0. Comment, share, star on GH to get the word out

3,328

Cartesia · May 22, 2026 · 5:44 PM UTC

Cartesia

@cartesia

May 22

Sonic 3.5 is now the #1 text to speech model on the @ArtificialAnlys leaderboard! You no longer have to trade off quality and latency - Sonic 3.5 also has the fastest time to first audio at 82ms end to end. See full benchmark results 👇

Artificial Analysis

@ArtificialAnlys

May 22

Cartesia’s Sonic-3.5 takes the #1 spot on the Artificial Analysis Speech Arena Leaderboard, surpassing Inworld Realtime TTS 1.5 Max and Google’s Gemini 3.1 Flash TTS Sonic-3.5 is the latest TTS model from @cartesia . It supports 42 languages, including 9 Indian languages, with 500+ voices available out of the box. The model has been highly preferred among voters in the TTS Arena, with its demonstrated naturalness and accurate transcript following. Key takeaways: ➤ Quality: Sonic-3.5 has an Elo score of 1,218 (+16/-16) based on 1,144 arena appearances, placing it ahead of Inworld Realtime TTS 1.5 Max at 1,194 and Gemini 3.1 Flash TTS at 1,209 ➤ Pricing: Sonic-3.5 is priced at $39/1M characters, a premium compared to Gemini 3.1 Flash TTS at $18.3/1M characters, and Inworld Realtime TTS 1.5 Max at $35/1M characters ➤ Speed: 105.5 characters per second, compared to 205 characters per second for Inworld Realtime TTS 1.5 Max and 26.3 characters per second for Gemini 3.1 Flash TTS See more details and listen to samples below 🧵

11,675

Cartesia · May 21, 2026 · 5:25 PM UTC

Cartesia

@cartesia

May 21

NYC happy hour with @awscloud Thurs, June 4 · 5:30pm · NY tech week RSVP in comments 👇

2,466

Cartesia · May 21, 2026 · 5:26 PM UTC

Cartesia

@cartesia

May 21

RSVP: partiful.com/e/sp4Bjt8b6PEBp…

Voice AI After Hours: Cartesia x AWS Happy Hour - #NYTechWeek | Partiful

Cartesia and AWS are hosting NYC's voice AI community for a happy hour. Founders, engineers, researchers, anyone building with voice. Come hang. Drinks and bites on us. Free Cartesia credits for...

partiful.com

1,178

LiveKit · May 7, 2026 · 3:57 PM UTC

Cartesia retweeted

LiveKit

@livekit

May 7

Voice cloning is now available on LiveKit Inference. We’re launching with @inworld_ai and @cartesia. Clone a voice once and use it across multiple TTS providers, with automatic fallback to the same voice if a provider fails mid-call. Free to create and available on all paid plans today.

11,131

Timothy Luong (Chongz) · May 7, 2026 · 6:27 AM UTC

Cartesia retweeted

Timothy Luong (Chongz)

@chongz

May 7

Every day I curse @krandiash for tainting us, removing @cartesia’s purity as a neo lab in the pursuit of “revenue”

Deedy

@deedydas

May 7

The Ultimate List of Artificial Intelligence "Neolabs": May 2026. A Neolab is a pre-revenue scale startup working on long-term AI breakthroughs, usually with a $1B+ valuation. There are now 63 of them!

19,791

Cartesia · Mar 31, 2026 · 10:13 PM UTC

Cartesia

@cartesia

Mar 31

Nobody in voice AI is talking about TCPA compliance. Enterprise buyers ask about it first. @2xSolutions CEO Kevin DeMeritt processed $4B in business at Lear Capital - and built 2X Solutions around the TCPA compliance reality that most platforms ignore. Switching to Cartesia’s sub-100ms latency kept them well-within a two-second TCPA window to power millions of calls. Voice quality sealed the deal for live demos. 2X customers even ask to “talk to Mary" (Mary is the AI 🤖)

1,726

Cartesia · Mar 31, 2026 · 10:13 PM UTC

Cartesia

@cartesia

Mar 31

Cartesia \ 2X Solutions Builds Secure, TCPA-Compliant Voice AI with Cartesia

How a compliance-first voice AI platform uses sub-100ms latency to stay within TCPA guidelines across millions of calls

cartesia.ai

1,318

Cartesia · Mar 26, 2026 · 4:04 PM UTC

Cartesia

@cartesia

Mar 26

“There were moments where the Cartesia team was telling us things that were happening with our product before we even knew it,” said @fundamentoAI Co-Founder Vickram Saigal. “That gave us a lot of trust – we’ve got the right partners.” @fundamentoAI runs 20M+ monthly outbound calls for India’s largest lenders and insurers. Cartesia was 2x faster than any other provider they tested, and delivered true enterprise partnership.

1,219

Cartesia · Mar 26, 2026 · 4:04 PM UTC

Cartesia

@cartesia

Mar 26

Full story → go.cartesia.ai/fundamentox

Cartesia \ Fundamento Runs Millions of Financial Services Calls on Cartesia

How India's leading voice AI platform for lending and insurance drives a 42% lift in loan journey completions with Cartesia's speed and reliability

cartesia.ai

915

Cartesia · Mar 18, 2026 · 6:39 PM UTC

Cartesia

@cartesia

Mar 18

Mamba-3 is out! 🐍 SSMs marked a major advance for the efficiency of modern LLMs. Mamba-3 takes the next step, shaping SSMs for a world where AI workloads are increasingly dominated by inference. Read about it on the Cartesia blog: blog.cartesia.ai/p/mamba-3

Mamba-3: An Inference-First State Space Model | Cartesia Blog

Mamba-3 reorients state space models around modern inference workloads, improving quality while preserving the low-latency profile that makes linear models compelling.

inside.cartesia.ai

175

73,362

Cartesia · Mar 13, 2026 · 12:16 AM UTC

Cartesia

@cartesia

Mar 13

The world’s leading AI infrastructure platforms are converging on the same voice model 🔥 Excited to announce that Cartesia is now a dedicated model partner on @togethercompute's Voice Platform for the 450K+ teams and developers building on Together.

5,373

Cartesia · Mar 13, 2026 · 12:16 AM UTC

Cartesia

@cartesia

Mar 13

Cartesia \ Together AI Chooses Cartesia as Dedicated Model Partner for Enterprise Voice AI

As leading AI infrastructure platforms converge on Cartesia, Sonic comes natively to serve millions of audio minutes on Together AI’s Voice Platform

cartesia.ai

998