We’re excited to announce that Modular has entered an agreement to be acquired by @Qualcomm. The future of unified compute has never been stronger. Read the full announcement: modular.com/blog/qualcomm-to…
This is one of the most exciting moments in Modular's history.
Momentum is building fast, and there's a lot more ahead. We're putting it all in one place: ModCon.
ModCon is where you'll see what comes next. New capabilities across Modular Platform, MAX, and Mojo. The roadmap ahead. And what it all means for the engineers and teams running AI in production every day.
If you want to hear our biggest product announcements first, this is the place to be: August 18th in San Francisco.
Early-bird pricing closes July 1: modular.com/modcon?utm_sourc…
Sign up for updates on our AI Engineer World's Fair booth, including booth location, demo times, swag drops, and even a @clattner_llvm appearance 👀: luma.com/ai-engineer
Today, Modular announced an agreement to be acquired by @Qualcomm.
We founded Modular to build a unified compute platform for the world, and unlock a more open, efficient, and hardware-independent future for AI. With Qualcomm, we can bring that vision to more developers, enterprises, and hardware platforms— faster.
Modular co-founder and CEO @clattner_llvm explains our continued commitment to our mission: piped.video/watch?v=RvEwWlSM…
Modular is still open sourcing Mojo this year, right on track. We can’t wait to share the details at ModCon, August 18th in San Francisco: modular.com/modcon
We’re excited to announce that Modular has entered an agreement to be acquired by @Qualcomm. The future of unified compute has never been stronger. Read the full announcement: modular.com/blog/qualcomm-to…
SF, we're coming for you! Catch us at AI Engineer World's Fair, June 29-July 2 at Moscone West. @aiDotEngineer
We'll have MAX and Mojo running live at the booth, the whole team on the floor, and answers to your toughest inference questions.
Sign up for updates on our booth, live demos, swag, and more: luma.com/ai-engineer
Free 4-week Mojo course, taught by the team that built the language. See why Mojo is the ideal language for agentic development and diverse hardware.
July 9th to 30th, live on YouTube, and open to all!
Register: luma.com/mojo-101-session-1
The Modular community has been busy! Come see what people are building at today's community meeting:
• Max walks through his adventures porting OpenCL kernels to Mojo
• Yuhao presents StaMojo, a statistics library for Mojo
• Seyoon shares MojoR, a JIT compiler that maps R semantics into Mojo kernels
Join us at 10 AM PT via Zoom: luma.com/june-modular
AI is changing how we build software. Curiosity still matters.
@clattner_llvm, CEO and Co-founder of @Modular, shares his perspective on entering the industry today.
Modular 26.4 is out.
Today's release brings state-of-the-art Mixture-of-Experts serving to Modular Cloud, expands MAX support for the newest open-weight models, and takes another step toward Mojo 1.0.
Modular Cloud now supports the latest frontier models, including @MiniMax_AI's M3, @Zai_org's GLM 5.2, and @Kimi_Moonshot's Kimi 2.7.
26.4 also ships enhanced quantization and speculative decoding capabilities, extended Apple silicon GPU support, model bring-up via agent skills, and more.
Dive into all the changes: modular.com/blog/modular-26-…
.@Zai_org just launched GLM 5.2 with:
- Solid 1M Context: A solid 1M-token context that stably sustains long-horizon work
- Advanced Coding with Flexible Effort: Stronger coding capabilities with multiple thinking effort levels
- Improved Architecture: Reuses the same indexer across every four sparse attention layers, reducing per-token FLOPs by 2.9× at a 1M context length - Speculative decoding MTP: Increases acceptance length by up to 20%
- Fully Open: An MIT OSS license with no regional limits
Congrats to the Z.ai team on a fantastic launch!
GLM 5.2 is available on Modular Cloud, Day Zero:
modular.com/models/glm-5-2
.@zai_org open-sourced GLM 5.2 today, and Modular is a Day Zero launch partner.
GLM 5.2 is their new flagship for coding and long-horizon agentic work, with usable 1M-token context built for tasks that run long and call a lot of tools.
Serving a model like this well is a full-stack problem. As context grows, the KV cache grows with it, and doing it economically at high concurrency takes more than a config flag. The Modular stack optimizes the path from GPU kernels to serving, which lets us run frontier open models on Day 0 with the utilization and economics agent workloads need.
It's available on Modular Cloud now. Request access: console.modular.com/signup