Soniox · Jun 16, 2026 · 2:55 PM UTC

Soniox

Pinned Tweet

Soniox

@soniox_ai

Jun 16

Soniox v5 Real-Time is now available. Live speech AI is not batch transcription with lower latency. It has to turn raw, noisy, continuous audio into structured intelligence while people speak. What’s new: • Higher accuracy across 60+ languages • Completely reengineered speaker separation • Better spoken language identification • Higher-quality real-time translation across 3,600+ language pairs • Faster semantic endpointing for voice agents • Better alphanumeric recognition • More robust native context handling Built for voice agents, meetings, captions, translation, dictation, customer support, contact centers, and multilingual products. Read more: soniox.com/blog/soniox-v5-re…

107

3,539,375

Soniox · Jun 26, 2026 · 1:56 PM UTC

Soniox

@soniox_ai

Jun 26

Don’t trust STT benchmarks. Too clean. Too closed. Too English-heavy. Too easy to cherry-pick. Real speech is chaos: accents, noise, code-switching, interruptions, names, numbers, IDs, domain terms. So we built Soniox Compare STT. Open source. Raw output. Trust no one. Test everyone. soniox.com/compare

1,448,106

Soniox · Jun 25, 2026 · 4:50 PM UTC

Soniox

@soniox_ai

Jun 25

Telugu with the dialect intact, used to write actual stories. That's the entire point of building for 60+ languages instead of one. Thanks for this.

Manas Krishnakant @manaskrshnakant

Jun 25

Have tried a lot of transcription apps but @soniox_ai is the best. It has live transcription and translation facilities. I was surprised it transcribed Telugu so well even the dialect was transcribed properly. Thanks a ton @soniox_ai you made my writing stories easy.

267

Soniox · Jun 25, 2026 · 2:50 PM UTC

Soniox

@soniox_ai

Jun 25

Sub-250ms endpointing with Soniox v5 Real-Time. This is the next level of voice agent experience.

Klemen Simonic

@klemensimonic

Jun 25

Real-time voice AI breaks when end-of-turn detection is wrong. Manifone, a telecom and voice AI company in France, integrated Soniox v5 Real-Time endpointing into Manivox.ai and is now seeing endpoint finalization in under 250ms after the phrase ends. Fast turn-taking. Fewer false endpoints. Natural voice agents.

600

Soniox · Jun 23, 2026 · 9:35 AM UTC

Soniox

@soniox_ai

Jun 23

Voice AI cannot be English-first. ATLO, a startup in South Korea, is building AI companion apps, robots, meeting assistants, and smart home devices. They use Soniox extensively for real-world voice AI. Korean is not an edge case. Every language matters. The future of AI is global, multilingual, real-time, and voice-first. Thank you, Sunghyun Park and ATLO.

2,481

Soniox · Jun 22, 2026 · 5:55 PM UTC

Soniox

@soniox_ai

Jun 22

With Soniox v5 Real-Time you get a far more robust mid-sentence language switching and speaker diarization. Diarization also holds up on harder audio, when people talk over each other. All our models provide multilingual output by default, and speaker diarization can be turned on with a single parameter (enable_speaker_diarization: true) without any additional costs. Works across all 60+ languages.

569

Soniox · Jun 20, 2026 · 9:53 AM UTC

Soniox

@soniox_ai

Jun 20

With Soniox stt-rt-v5 model endpoint detection receives additional configuration controls. This allows users to fine-tune endpointing behavior to the needs of their implementation: - endpoint_latency_adjustment_level - endpoint_sensitivity - max_endpoint_delay_ms Read more on how they work together and how to tune them to specific use cases in our docs: soniox.com/docs/stt/rt/endpo…

Endpoint detection

Learn how real-time endpoint detection works and how to tune it for your application.

soniox.com

696

Soniox · Jun 17, 2026 · 8:25 PM UTC

Soniox

@soniox_ai

Jun 17

Soniox v5 Real-Time introduces endpoint_sensitivity. It adjusts how likely the model is to emit an endpoint. Higher values make endpoints more likely, finalizing segments sooner. Lower values make them less likely, so the system waits longer before finalizing. Tune it for fast voice agent turn-taking or for dictation and mid-sentence pausers. Learn more about it from endpoint detection docs: soniox.com/docs/stt/rt/endpo…

Endpoint detection

Learn how real-time endpoint detection works and how to tune it for your application.

soniox.com

400

Eirik Hoem · Jun 17, 2026 · 9:53 AM UTC

Soniox retweeted

Eirik Hoem @eirikhm

Jun 17

We use Soniox extensively at Tana, great stuff!

Soniox

@soniox_ai

Jun 16

298

Soniox · Jun 15, 2026 · 9:20 PM UTC

Soniox

@soniox_ai

Jun 15

Soniox is now a provider in @LiteLLM. We are always happy to see new community integrations. Thanks @dan2k3k4 (cc @amazeeio). docs.litellm.ai/release_note…

v1.89.0 - Claude Fable 5, A2A Agent Providers & MCP Per-Server Controls | liteLLM

Deploy this version

docs.litellm.ai

329

Soniox · Jun 14, 2026 · 9:10 PM UTC

Soniox

@soniox_ai

Jun 14

Speaker diarization is one of the hardest problems in speech AI. People interrupt, laugh, and talk at once. Acoustic-only systems break when voices sound alike or overlap. Soniox v5 Async uses the sound and the meaning together to figure out who said what, which leads to better separation in real-life conversations.

1,477

Soniox · Jun 13, 2026 · 5:55 PM UTC

Soniox

@soniox_ai

Jun 13

The easiest way to try Soniox Async v5 in your code: use our Python or Node SDK. Call transcribe_and_wait_with_tokens, wait, read the audio transcription from the result. Done.

474

Soniox · Jun 11, 2026 · 2:47 PM UTC

Soniox

@soniox_ai

Jun 11

Soniox v5 Async is live. Our new async speech-to-text model turns real-world audio into more accurate, structured speech data. What’s improved: • Higher accuracy across 60+ languages • Completely reengineered speaker separation for identifying who said what • Improved language identification for multilingual and accented speech • Better recognition and formatting of numbers, dates, emails, IDs, codes, names, and addresses • More robust context usage for names, domain vocabulary, product terms, and custom phrases stt-async-v5 is fully compatible with the existing async API. Just update the model name. Read more: soniox.com/blog/soniox-v5-as…

2,526,321

Soniox · Jun 10, 2026 · 3:31 PM UTC

Soniox

@soniox_ai

Jun 10

Google now has Gemini Live Translate.  Soniox has Real-World Live Translate.

756,197

Soniox · Jun 10, 2026 · 9:05 AM UTC

Soniox

@soniox_ai

Jun 10

Soniox shows its performance already on simple audio input. Once you throw in IDs, numbers, emails, addresses, and actual hard speech, the accuracy gap just grows bigger. A broken speech recognition layer makes the rest of the pipeline fall apart, and a laggy service amplifies it. Your voice agents deserve a speech system that does not fall apart.

中崎工房 | AIで1時間の仕事を5分で終わらせる人

@nakazakifam

Jun 9

話題のGemini 3.5 Live Translateを少し前に話題になった、GPT-Realtime-Translateと私が自作アプリで使っている圧倒的コスパのSonioxと比較テストしました。結論：GPT不安定、Geminiさすが、Sonioxすごい。ASRの速度と精度がこの中でいちばんに見える。ただ、私の声をマイク音声で音声アウトプットもパソコンのスピーカーからやったので全然本来の力を発揮できていない可能性もあります😅 また真面目な比較テストをしたいと思います。

1,161

Soniox · Jun 9, 2026 · 11:26 AM UTC

Soniox

@soniox_ai

Jun 9

Stop overpaying for speech AI. Compare your bill across providers with our new pricing calculator. soniox.com/compare#calculato…

3,220,077