The AI omnicloud

Palo Alto & SF, CA
Foundry is now Mithril - the AI omnicloud We're excited to share what this next chapter will bring
2
1
15
3,717
We're excited to announce $80M in seed and Series A funding co-led by @sequoia and @lightspeedvp to further our mission of orchestrating the world’s compute capacity, making it universally accessible and useful. How we can help 👇
3
7
123
154,095
Thank you so much to @Nasdaq and @Redpoint for choosing to celebrate our $80M in seed and Series A funding announcement 📣 on the @Nasdaq Times Square billboard 🚀 we had a blast and are beyond grateful 🙌 @jaredq_ @sigalitperelson @BorisHanin @adeeshjain @Amanda__Hecker @therealadeesh
2
5
35
7,110
We are honored to have @shaunmmaguire partner at @sequoia on our side since day 0 🙌 check out his blog post about partnering with us sequoiacap.com/article/partn…
1
7
27
7,752
Fantastic to see AI founders, researchers, and builders coming together last night. Thanks for hosting, @FactoryAI 🤝
4
20
7,836
Thank you to @ravirajjain and the entire @lightspeedvp team for partnering with us at the Series A 🙏 check out the latest blog post 👉
.@mlfoundry's distributed compute service simplifies AI model training with veterans from Google DeepMind and Meta. Maximizing the utility of the computing power we already have, and will produce, arrives at the perfect moment in our AI trajectory - and why we led their Series A. Learn more: lsvp.com/stories/scaling-ai-… Cc: @ravirajjain and @ravi_lsvp
6
20
5,733
Check us out if you want to be GPU rich @shaunmmaguire 🙌
Are you GPU poor and want to be GPU rich? 🤑 What if we said the real bottleneck right now isn't GPU supply but rather pathetic GPU utilization? ⚡️ @mlfoundry got to 8 figures+ of revenue while operating in stealth This cracked team led by @jaredq_ lives at the edge of AI
1
2
16
6,704
It’s not a GPU supply issue, it’s a utilization issue. You heard it here first. @YahooFinance
1
5
15
2,536
Thank you to all the students who joined us for our AI intern event last week at the @sequoia NYC rooftop! Amazing to meet the incredible talent coming into our industry 📈
1
17
3,054
Networks of networks (NoNs), which compose many inference calls to multiple monolithic AI models, can significantly improve system accuracy for certain domains, particularly those requiring advanced reasoning. How, though, to compose these calls? What principles can we use to guide the composition of NoNs? Read the full paper from our founder and CEO @jaredq_ and co-authors @BorisHanin, @ChenLingjiao, @pbailis, Ion Stoica, and @matei_zaharia 👇 Arxiv: arxiv.org/pdf/2407.16831
In this article, I and co-authors @BorisHanin, @ChenLingjiao, @pbailis, Ion Stoica, and @matei_zaharia explore one of the most powerful ideas we have yet discovered to inform compound AI systems design: verifiability. In common situations where practitioners are willing to expend a higher budget to go beyond the capabilities frontier accessible to today's state-of-the-art (SOTA) monolithic models, they may be willing to invoke many model inference calls, composing them into “networks of networks” (NoNs) of sorts. The question then becomes: what principles should guide the composition of these NoNs? Inspired by TCS and PCP notions that often verification is easier than generation (as holds for classical problems like graph coloring), we construct “best-of-K” or “judge-based” Compound AI Systems, which explicitly separate “generator” modules from “verifier” modules. We posit that these systems are particularly helpful for “reasoning-based” or “procedural-knowledge” oriented tasks, which are often more verifiable, less so for factual or declarative-knowledge settings (and we can use these systems partly to help characterize tasks, including subjects in the MMLU, along these lines). Very neatly, it turns out we can analytically characterize when these systems can confer a gain and predict the gain’s extent. We hope people will extend these ideas to tackle some of the reasoning-oriented application frontiers that are a bit beyond the range of today’s SOTA models. Arxiv: arxiv.org/pdf/2407.16831
4
9
1,727
We are honored to have our industry’s top technical and commercial minds along for the journey including @Redpoint, @M12vc, @w_conviction, @nea_ventures, and angels and advisors such as @JeffDean, @ericschmidt and @iendeavors, and @matei_zaharia, as we're contributing to the future of computing.
7
596
Elasticity: Need a sudden burst in GPU capacity? Experiencing an unforeseen usage spike? Foundry is built to scale with you.
1
7
2,321
Simplicity: We are making cluster management and workload orchestration tools accessible to everyone, not just industry giants.
1
7
2,150
Access to compute shouldn't be the limiting factor for researchers and engineers at the forefront of AI. Learn more about how we envision the future of the cloud for AI in this post from our Founder and CEO, @jaredq_ 👇 mlfoundry.com/blog/restoring…
1
6
1,782
Availability: We have capacity. @nvidia H100s, A100s, and more.
1
7
3,030
Foundry exists to shift the paradigm and trajectory of AI infrastructure, making it more accessible, efficient, and powerful.
6
1,747
Security and resiliency: We’ve built to the highest security and compliance standards from day one. We are SOC 2 Type II certified and work with enterprises of all sizes to predict and resolve errors proactively to maximize uptime.
1
6
2,033
Learn more about our vision and check out our first blog post by our founder and CEO, @jaredq_: mlfoundry.com/announcements/…
1
5
733
Burst spot instances. Spot instances are preemptible, giving you access to Foundry’s flexible buffer capacity. They can offer up to 20x better price-performance for more dynamic and horizontally scalable workloads.
1
1
169
Automate spot scaling and orchestration. Take full advantage of spot compute by placing orders via API, adding instances to a hosted K8s cluster to scale horizontally, creating scripts to gracefully handle startup and preemption, automatically mounting storage, and more.
1
1
469
Sell back what you don’t use. If you end up reserving more compute than you need or your workload ends early, you can relist any unused nodes on the spot market. Relisted nodes generate credits until you take them back or your original reservation ends.
1
1
210
Reserve GPU clusters for as little as 3 hours. Provision precisely the capacity you need to support critical training runs and time-sensitive development tasks. Reserve through the Foundry console; no procurement processes, no contracts.
1
1
329
@jaredq_ our founder and CEO on our latest announcement 🙌 check out our blog post 💪
We’re honored to announce our seed and Series A funding, and thankful to our partners and investors! With a mission of orchestrating the world’s compute capacity, making it universally accessible and useful, exciting things are on the horizon. Learn more about what's ahead: mlfoundry.com/announcements/…
1
105
Reserve only what you need. We’ll handle any hardware failures. If we detect that one of your reserved nodes is failing, we can often replace it proactively from our pool of healing buffer nodes. No need to overprovision to account for GPU failures.
1
1
232