Arthur Mensch · Dec 12, 2023 · 7:09 AM UTC

Arthur Mensch

Arthur Mensch

@arthurmensch

12 Dec 2023

Removed, enjoy !

Far El

@far__el

11 Dec 2023

So Mistral prohibits you from using their models to train or improve other models or compete against them........... I thought they were fully open...... mistral.ai/terms-of-use/

367

626

7,919

2,187,745

Arthur Mensch · Feb 26, 2024 · 2:23 PM UTC

Arthur Mensch

@arthurmensch

26 Feb 2024

We’re announcing a new optimised model today! Mistral Large has top-tier reasoning capacities, is multi-lingual by design, has native function calling capacities and a 32k model. The pre-trained model has 81.2% accuracy on MMLU. Learn more on mistral.ai/news/mistral-larg…. Mistral Large is available on la Plateforme, as well as on Azure, following our commitment to bring frontier AI everywhere. We're eager to see what developers do with Mistral Large!

Au Large | Mistral AI

The most powerful AI platform for enterprises. Customize, fine-tune, and deploy AI assistants, autonomous agents, and multimodal AI with open models.

mistral.ai

311

2,230

282,014

Arthur Mensch · Feb 9, 2025 · 8:22 AM UTC

Arthur Mensch

@arthurmensch

9 Feb 2025

Self-qualifying oneself as heavyweight while shipping nothing of significance looks like hubris to me

Christian Szegedy

@ChrSzegedy

8 Feb 2025

The AI race is very hard to enter at this point: even Mistral is a small player. The US has at least 7 heavyweights, China 3, the EU: 0. In the coming 1-3 years (as AIs become inreasingly capable) the amount of available compute will further gain on importance (self acceration). Even harder to catch up. Also I don't think it should be the givernment's role to push this directly: what hinders the EU is lack of VC money, overregulation and taxes.

1,919

271,845

Arthur Mensch · Jun 11, 2024 · 3:59 PM UTC

Arthur Mensch

@arthurmensch

11 Jun 2024

We are announcing €600M in Series B funding for our first anniversary. We are grateful to our new and existing investors for their continued confidence and support for our global expansion. This will accelerate our roadmap as we continue to bring frontier AI into everyone’s hands.

119

122

1,882

451,043

Arthur Mensch · Jan 31, 2024 · 4:55 PM UTC

Arthur Mensch

@arthurmensch

31 Jan 2024

An over-enthusiastic employee of one of our early access customers leaked a quantised (and watermarked) version of an old model we trained and distributed quite openly. To quickly start working with a few selected customers, we retrained this model from Llama 2 the minute we got access to our entire cluster — the pretraining finished on the day of Mistral 7B release. We've made good progress since — stay tuned!

159

1,567

645,865

Arthur Mensch · Sep 27, 2023 · 2:37 PM UTC

Arthur Mensch

@arthurmensch

27 Sep 2023

At @MistralAI we're releasing our very first model, the best 7B in town (outperforming Llama 13B on all metrics, and good at code), Apache 2.0. We believe in open models and we'll push them to the frontier mistral.ai/news/about-mistra… Very proud of the team !

Bringing open AI models to the frontier

Why we're building Mistral AI.

mistral.ai

187

1,508

288,307

Arthur Mensch · Jan 30, 2025 · 2:32 PM UTC

Arthur Mensch

@arthurmensch

30 Jan 2025

A new model to hasten AI progress. 24B, 81% MMLU, no RL for now! We're super excited to see the latest development in international open-source AI (kudos to Deepseek!), and cannot wait to bring new contributions to it. We're renewing our commitment to using Apache licenses. AI brings joy: accelerate.

Mistral AI

@MistralAI

30 Jan 2025

magnet:?xt=urn:btih:11f2d1ca613ccf5a5c60104db9f3babdfa2e6003&dn=Mistral-Small-3-Instruct&tr=udp%3A%2F%2Ftracker.opentrackr.org%3A1337%2Fannounce&tr=http%3A%2F%https://nitter.app/t.co/ua2yzvEYLu%3A1337%2Fannounce

105

127

1,515

167,788

Arthur Mensch · Nov 16, 2023 · 9:00 AM UTC

Arthur Mensch

@arthurmensch

16 Nov 2023

We have heard many extrapolations of Mistral AI’s position on the AI Act, so I’ll clarify. In its early form, the AI Act was a text about product safety. Product safety laws are beneficial to consumers. Poorly designed use of automated decision-making systems can cause significant damage in many areas. In healthcare, a diagnosis assistant based on a poorly trained prediction system poses risks to the patient. Product safety regulation should be proportional to the risk level of the use case: it is undesirable to regulate entertainment software in the same way as health applications. The original EU AI Act found a reasonable equilibrium in that respect. We firmly believe in hard laws for product safety matters; the many voluntary commitments we see today bear little value. This should remain the only focus of the AI Act. The EU AI Act now proposes to regulate “foundational models”, i.e. the engine behind some AI applications. We cannot regulate an engine devoid of usage. We don’t regulate the C language because one can use it to develop malware. Instead, we ban malware and strengthen network systems (we regulate usage). Foundational language models provide a higher level of abstraction than the C language for programming computer systems; nothing in their behaviour justifies a change in the regulatory framework. Enforcing AI product safety will naturally affect the way we develop foundational models. By requiring AI application providers to comply with specific rules, the regulator fosters healthy competition among foundation model providers. It incentivises them to develop models and tools (filters, affordances for aligning models to one's beliefs) that allow for the fast development of safe products. As a small company, we can bring innovation into this space — creating good models and designing appropriate control mechanisms for deploying AI applications is why we founded Mistral. Note that we will eventually supply AI products, and we will craft them for zealous product safety. With a regulation focusing on product safety, Europe would already have the most protective legislation globally for citizens and consumers. Any foundational model would be affected by second-order regulatory pressure as soon as they are exposed to consumers: to empower diagnostic assistants, entertaining chatbots, and knowledge explorers, foundational models should have controlled biases and outputs. Recent versions of the AI Act started to address ill-defined “systemic risks”. In essence, the computation of some linear transformations, based on a certain amount of calculation, is now considered dangerous. Discussions around that topic may occur, and we agree that they should accompany the progress of technology. At this stage, they are very philosophical – they anticipate exponential progress in the field, where physics (scaling laws!) predicts diminishing returns with scale and the need for new paradigms. Whatever the content of these discussions, they certainly do not pertain to regulation around product safety. Still, let’s assume they do and go down that path. The AI Act comes up with the worst taxonomy possible to address systemic risks. The current version has no set rules (beyond the term highly capable) to determine whether a model brings systemic risk and should face heavy or limited regulation. We have been arguing that the least absurd set of rules for determining the capabilities of a model is post-training evaluation (but again, applications should be the focus; it is unrealistic to cover all usages of an engine in a regulatory test), followed by compute threshold (model capabilities being loosely related to compute). In its current format, the EU AI Act establishes no decision criteria. For all its pitfalls, the US Executive Order bears at least the merit of clarity in relying on compute threshold. The intention of introducing a two-level regulation is virtuous. Its effect is catastrophic. As we understand it, introducing a threshold aims to create a free innovation space for small companies. Yet, it effectively solidifies the existence of two categories of companies: those with the right to scale, i.e., the incumbent that can afford to face heavy compliance requirements, and those that can’t because they lack an army of lawyers, i.e., the newcomers. This signals to everyone that only prominent existing actors can provide state-of-the-art solutions. Mechanistically, this is highly counterproductive to the rising European AI ecosystem. To be clear, we are not interested in benefiting from threshold effects: we play in the main league, we don’t need geographical protection, and we simply want rules that do not give an unfair advantage to incumbents (that all happen to be non-European). Transparency around technology development benefits safety and should be encouraged. Finally, we have been vocal about the benefits of open-sourcing AI technology. This is the best way to subject it to the most rigorous scrutiny. Providing model weights to the community (or even better, developing models in the open end-to-end, which is not something we do yet) should be well regarded by regulators, as it allows for more interpretable and steerable applications. A large community of users can much more efficiently identify the flaws of open models that can propagate to AI applications than an in-house team of red-teamers. Open models can then be corrected, making AI applications safer. The Linux kernel is today deemed safe because millions of eyes have reviewed its code in its 32 years of existence. Tomorrow’s AI systems will be safe because we’ll collectively work on making them controllable. The only validated way of working collectively on software is open-source development. Long prose, back to building!

332

1,330

789,363

Arthur Mensch · Dec 11, 2023 · 8:11 AM UTC

Arthur Mensch

@arthurmensch

11 Dec 2023

Announcing Mixtral 8x7B mistral.ai/news/mixtral-of-e… and our early developer platform mistral.ai/news/la-plateform…. Very proud of the team!

Mixtral of experts | Mistral AI

The most powerful AI platform for enterprises. Customize, fine-tune, and deploy AI assistants, autonomous agents, and multimodal AI with open models.

mistral.ai

187

1,330

231,007

Arthur Mensch · Oct 16, 2024 · 2:56 PM UTC

Arthur Mensch

@arthurmensch

16 Oct 2024

Introducing the world's best edge models. mistral.ai/news/ministraux/

Un Ministral, des Ministraux | Mistral AI

The most powerful AI platform for enterprises. Customize, fine-tune, and deploy AI assistants, autonomous agents, and multimodal AI with open models.

mistral.ai

118

1,202

136,286

Arthur Mensch · Feb 28, 2024 · 12:35 PM UTC

Arthur Mensch

@arthurmensch

28 Feb 2024

Clarifying a couple of things since we’re reading creative interpretations of our latest announcements: - We’re still committed to leading open-weight models! We ask for a little patience, 1.5k H100s only got us that far. - We have a reselling agreement with Microsoft, that we’re very excited about. Alongside similar partnerships, it will accelerate our growth. - Microsoft invested in a small convertible note alongside many other companies, as a distribution partner. We are an independent European company with global ambitions, that part is not changing either. We’re seeing some interest for Le Chat and Mistral Large, on both la Plateforme and Azure, and we’ll be iterating fast!

144

1,081

157,295

Arthur Mensch · Sep 9, 2025 · 7:33 AM UTC

Arthur Mensch

@arthurmensch

9 Sep 2025

We're back to school! Very proud of our team accomplishments, and honored to partner with @ASMLcompany in our next phase. We're very excited to push frontier AI capabilities in science and technology, with exciting releases ahead.

Mistral AI

@MistralAI

9 Sep 2025

We’ve raised €1.7B to accelerate technological progress with AI! This Series C funding round, led by @ASMLcompany, fuels Mistral AI scientific research to keep pushing the frontier of AI to tackle the most critical technological challenges faced by strategic industries.

1,106

117,524

Arthur Mensch · Mar 18, 2024 · 9:46 PM UTC

Arthur Mensch

@arthurmensch

18 Mar 2024

Congratulations, interesting how Github stars seem to correlate to superfluous parameters 😉

Igor Babuschkin

@ibab

18 Mar 2024

The Grok-1 repo is getting pretty popular. I will be responding to pull requests and issues. Feel free to contribute!

902

157,096

Arthur Mensch · Nov 1, 2023 · 7:09 AM UTC

Arthur Mensch

@arthurmensch

1 Nov 2023

It may soon be a crime to compress public domain human knowledge into public domain matrices. We need to regulate the usage of AI in applications, not gradient descent

150

969

380,491

Arthur Mensch · Jan 27, 2024 · 10:39 AM UTC

Arthur Mensch

@arthurmensch

27 Jan 2024

Mixtral is now powering Leo, Brave browser assistant!

Brave

@brave

25 Jan 2024

Today's update for Brave on desktop (v1.62) dramatically improves Leo, our privacy-preserving AI browser assistant. The most important upgrade is that we've changed the default LLM for Leo to the high-performing and open-source Mixtral 8x7B from @MistralAI for all users.

812

145,479

Arthur Mensch · Mar 6, 2025 · 10:51 PM UTC

Arthur Mensch

@arthurmensch

6 Mar 2025

Our new OCR model is available through our public API and as a self-deployable solution. In use on le Chat, and part of our specialist model family that is getting bigger !

Mistral AI

@MistralAI

6 Mar 2025

Introducing the world's best OCR model! mistral.ai/news/mistral-ocr

749

72,389

Arthur Mensch · Apr 17, 2024 · 2:01 PM UTC

Arthur Mensch

@arthurmensch

17 Apr 2024

Official now, very proud of the team! Apache 2.0 and instructed versions for your pleasure, available today on la Plateforme mistral.ai/news/mixtral-8x22…

Cheaper, Better, Faster, Stronger | Mistral AI

The most powerful AI platform for enterprises. Customize, fine-tune, and deploy AI assistants, autonomous agents, and multimodal AI with open models.

mistral.ai

656

63,652

Arthur Mensch · Feb 6, 2025 · 5:15 PM UTC

Arthur Mensch

@arthurmensch

6 Feb 2025

The team is fast! It's been super exciting to see le Chat more and more widely adopted. It's an early product, and we can't wait to show you what's coming next. mistral.ai/en/news/all-new-l…

The all new le Chat: Your AI assistant for life and work | Mistral AI

The most powerful AI platform for enterprises. Customize, fine-tune, and deploy AI assistants, autonomous agents, and multimodal AI with open models.

mistral.ai

Mistral AI

@MistralAI

6 Feb 2025

Le Chat is fast (1,100 tok/s for flash queries on an updated Mistral Large). Download it at mistral.ai/app/android or mistral.ai/app/ios

689

80,367

Arthur Mensch · Jul 18, 2025 · 7:17 PM UTC

Arthur Mensch

@arthurmensch

18 Jul 2025

Our first shot at audio

Mistral AI

@MistralAI

15 Jul 2025

Introducing the world's best (and open) speech recognition models!

648

67,691

Arthur Mensch · May 11, 2018 · 4:25 PM UTC

Arthur Mensch

@arthurmensch

11 May 2018

Our work w/ @mblondel_ml 'Differentiable Dynamic Programming for Structured Prediction and Attention' was accepted at @icmlconf ! arxiv.org/abs/1802.03676 Sparsity and backprop in CRF-like inference layers using max-smoothing, application in text + time series (NER, NMT, DTW)

189

627

Arthur Mensch · May 21, 2025 · 6:36 PM UTC

Arthur Mensch

@arthurmensch

21 May 2025

mistral.ai/news/devstral we’ve released an open-source model which is great at agentic coding tasks - by far Pareto optimal and a good prelude to what’s coming next

Devstral | Mistral AI

The most powerful AI platform for enterprises. Customize, fine-tune, and deploy AI assistants, autonomous agents, and multimodal AI with open models.

mistral.ai

625

38,921

Arthur Mensch · Aug 23, 2025 · 6:12 AM UTC

Arthur Mensch

@arthurmensch

23 Aug 2025

Complex matters slowly coming together — we actually got surprised ourselves

Mistral AI

@MistralAI

22 Aug 2025

Mistral Medium 3.1 just landed on @lmarena_ai leaderboard—punching way above its weight! 🏆 #1 in English (no Style Control) 🏆 2nd overall (no Style Control) 🏆 Top 3 in Coding & Long Queries 🏆 8th overall Small model. Big impact. Try it now on Le Chat and the API!

616

78,314

Arthur Mensch · Jul 18, 2024 · 2:51 PM UTC

Arthur Mensch

@arthurmensch

18 Jul 2024

Today, we're announcing Mistral NeMo, a tiny multilingual model, 128k context length, trained with quantization awareness in collaboration with the NVIDIA research team.

Mistral AI

@MistralAI

18 Jul 2024

mistral.ai/news/mistral-nemo…

603

52,373

Arthur Mensch · Jul 16, 2024 · 3:00 PM UTC

Arthur Mensch

@arthurmensch

16 Jul 2024

As the Olympic season reaches Paris, we express our respect to ancient Greeks by releasing two new research models, MathΣtral and Codestral Mamba.

Mistral AI

@MistralAI

16 Jul 2024

mistral.ai/news/mathstral/ mistral.ai/news/codestral-ma…

597

47,940

Arthur Mensch · Mar 17, 2025 · 6:17 PM UTC

Arthur Mensch

@arthurmensch

17 Mar 2025

Triangle of happiness!

Mistral AI

@MistralAI

17 Mar 2025

Introducing Mistral Small 3.1. Multimodal, Apache 2.0, outperforms Gemma 3 and GPT 4o-mini. mistral.ai/news/mistral-smal…

592

43,908

Arthur Mensch · Nov 18, 2024 · 5:44 PM UTC

Arthur Mensch

@arthurmensch

18 Nov 2024

Expanding from a science company to a science and product company was no easy task, and that release is a very significant milestone in our journey. We're looking forward to how you'll use le Chat, now a slightly more mature animal

Mistral AI

@MistralAI

18 Nov 2024

We're proud to introduce the next generation of le Chat. Search, PDF upload, coding, image generation, le Canevas... All in one place: chat.mistral.ai/ mistral.ai/news/mistral-chat…

583

67,248

Arthur Mensch · Jun 10, 2025 · 2:56 PM UTC

Arthur Mensch

@arthurmensch

10 Jun 2025

Reasoning with latency-optimized models is quite a UX game changer. Super proud of what the team has accomplished with this Magistral release! mistral.ai/news/magistral

539

33,217

Arthur Mensch · Nov 8, 2023 · 8:30 AM UTC

Arthur Mensch

@arthurmensch

8 Nov 2023

This person has put a Mistral 7B model into a stuffed parrot and displays Chinchilla Equation (2) on his torso. Does anyone have his number? This seems a little unsafe.

449

116,221

Arthur Mensch · Feb 26, 2024 · 2:24 PM UTC

Arthur Mensch

@arthurmensch

26 Feb 2024

As a small surprise, we’re also releasing le Chat Mistral, a front-end demonstration of what Mistral models can do. Learn more on mistral.ai/news/le-chat-mist…

Le Chat | Mistral AI

The most powerful AI platform for enterprises. Customize, fine-tune, and deploy AI assistants, autonomous agents, and multimodal AI with open models.

mistral.ai

445

83,869

Arthur Mensch · Jul 3, 2025 · 2:35 PM UTC

Arthur Mensch

@arthurmensch

3 Jul 2025

Mistral is proud to provide the text LLM powering Unmute, the open-source voice AI from @kyutai_labs!

kyutai @kyutai_labs

3 Jul 2025

Kyutai TTS and Unmute are now open source! The text-to-speech is natural, customizable, and fast: it can serve 32 users with a 350ms latency on a single L40S. Try it out and get started on the project page: kyutai.org/next/tts

460

41,714

Arthur Mensch · Mar 27, 2024 · 1:01 PM UTC

Arthur Mensch

@arthurmensch

27 Mar 2024

Welcome to the party

Jonathan Frankle

@jefrankle

27 Mar 2024

Meet DBRX, a new sota open llm from @databricks. It's a 132B MoE with 36B active params trained from scratch on 12T tokens. It sets a new bar on all the standard benchmarks, and - as an MoE - inference is blazingly fast. Simply put, it's the model your data has been waiting for.

440

87,996

Arthur Mensch · Mar 6, 2024 · 5:56 PM UTC

Arthur Mensch

@arthurmensch

6 Mar 2024

Our first hackathon! We’ll be in SF the week before to meet our users

Mistral AI for Developers

@MistralDevs

6 Mar 2024

Thrilled to announce the first @MistralAI Hackathon in San Francisco on March 23-24! Sign up at: partiful.com/e/Zk9c9HVsmtsGD… Keynote from Mistral AI founders: @arthurmensch & @GuillaumeLample. Mistral AI mentors: @dchaplot, @sandeep1337, @theo_gervet, @mjmj1oo, @sophiamyang

425

46,927

Arthur Mensch · May 29, 2024 · 2:12 PM UTC

Arthur Mensch

@arthurmensch

29 May 2024

With Codestral, our newest state-of-the-art code model, we are introducing the Mistral AI non-production license (MNPL). It allows developers to use our technology for non-commercial use and research. It ensures that every actor on the value chain builds successful businesses. mistral.ai/news/mistral-ai-n….

362

100,723

Arthur Mensch · Sep 17, 2024 · 4:47 PM UTC

Arthur Mensch

@arthurmensch

17 Sep 2024

Moving forward with a new Small, Pixtral on le Chat and la Plateforme, reduced prices across the board, and an API free tier!

Mistral AI

@MistralAI

17 Sep 2024

mistral.ai/news/september-24… 1/2

351

33,406

Arthur Mensch · Feb 27, 2024 · 10:41 PM UTC

Arthur Mensch

@arthurmensch

27 Feb 2024

Day 1 delivery as always :)

Perplexity

@perplexity_ai

27 Feb 2024

Mistral Large is now available to all Perplexity Pro users! Head to your settings page to set it as your default model or test drive it with our Rewrite feature. This model will be available on our mobile apps very soon. Stay tuned!

344

60,787

Arthur Mensch · Nov 2, 2023 · 7:49 PM UTC

Arthur Mensch

@arthurmensch

2 Nov 2023

Leaving the AI Safety Summit after some constructive discussions today and yesterday. I voiced how open-source was today the safest way to develop AI, putting this transformative technology under the highest level of scrutiny. With many others, we recalled the enormous opportunities that AI brings for better education, better healthcare, unlocking critical science problems, and making our jobs more rewarding. Transparency and fast information flow across different actors are needed to continue enabling these opportunities. Finally, we were many to stress how any new institutions measuring AI progress should be fully independent to avoid the pitfalls of regulatory capture. And it appears we have been heard! We must ensure that any such institution involves the entire world. More data is needed to understand the limitations of current models, and we look forward to engaging with the AI community to jointly agree on a scientific monitoring framework around new AI capabilities in the coming year.

330

141,065

Arthur Mensch · Nov 18, 2024 · 5:51 PM UTC

Arthur Mensch

@arthurmensch

18 Nov 2024

At Mistral, we've grown aware that to create the best AI experience, one needs to co-design models and product interfaces. Pixtral was trained with high-impact front-end applications in mind and is a good example of that.

Mistral AI

@MistralAI

18 Nov 2024

We also released Pixtral Large, a new SOTA vision model. mistral.ai/news/pixtral-larg…

323

43,902

Arthur Mensch · Feb 26, 2024 · 8:09 PM UTC

Arthur Mensch

@arthurmensch

26 Feb 2024

Replying to @far__el

It’s removed, we missed it in our final review — no joke of ours, just a lot of materials to get right !

306

34,534

Arthur Mensch · Sep 29, 2020 · 10:02 AM UTC

Arthur Mensch

@arthurmensch

29 Sep 2020

"Online Sinkhorn" refines an Optimal Transport distance between two continuous distributions, based on a stream of samples. Stochastic approximation again ! Joint work with @gabrielpeyre, to be presented (oral) at #NeurIPS2020. arxiv.org/abs/2003.01415 github.com/arthurmensch/onli…

274

Arthur Mensch · May 5, 2022 · 4:32 PM UTC

Arthur Mensch

@arthurmensch

5 May 2022

Flamingo does feel slightly conscious these days 🦩

276

Arthur Mensch · Apr 11, 2024 · 6:25 AM UTC

Arthur Mensch

@arthurmensch

11 Apr 2024

Replying to @xlr8harder @MistralAI

Apache 2.0 indeed

273

30,771

Arthur Mensch · Jun 19, 2024 · 9:35 AM UTC

Arthur Mensch

@arthurmensch

19 Jun 2024

Local Codestral generating #scikitlearn and Keras code to run a medical imaging prediction on the fly. I like that :)

Rohan Paul

@rohanpaul_ai

18 Jun 2024

Open Interpreter's new release looks great for locally running LLMs The tool lets LLMs run code (Python, Javascript, Shell, and more) locally. You can chat with Open Interpreter through a ChatGPT-like interface in your terminal by running $ interpreter after installing. Just do `pip install open-interpreter` and then just run `- interpreter --local` sets up fast, local LLMs. Congrats @hellokillian 👏

253

71,448

Arthur Mensch · May 7, 2025 · 3:22 PM UTC

Arthur Mensch

@arthurmensch

7 May 2025

Self deployable fully packaged frontier AI. If you’re a country you may want to look here instead, we can help!

Mistral AI

@MistralAI

7 May 2025

Introducing Le Chat Enterprise, the most customizable and secure agent-powered AI assistant for businesses, making AI a real leverage for competitiveness. - Integration with your company knowledge (starting with Gmail, Google Drive, Sharepoint…) - Ability to add frequently used documents for better-informed outputs - Enterprise-grade features: agents, coding assistant, web search, global news coverage… - Secure deployment: on-prem, in your cloud, or as a service.

251

18,590

Arthur Mensch · May 24, 2024 · 7:59 PM UTC

Arthur Mensch

@arthurmensch

24 May 2024

Very proud of the team, a first step towards making model customisation much simpler.

Mistral AI for Developers

@MistralDevs

24 May 2024

Announcing `mistral-finetune`, the official repo and guide on how to fine-tune Mistral open-source models: github.com/mistralai/mistral…

230

31,750

Arthur Mensch · Mar 5, 2024 · 6:51 PM UTC

Arthur Mensch

@arthurmensch

5 Mar 2024

Very excited to be bringing our models to Snowflake customers as part of this multi-year partnership. LLMs become all the more interesting when contextualised on data, and we’re eager to see developers create powerful applications combining Mistral models with the Data Cloud.

Snowflake

@Snowflake

5 Mar 2024

We’re excited to announce a global partnership to bring @MistralAI's most powerful language models directly to Snowflake customers in the Data Cloud. Learn more about how Snowflake users can leverage AI with their enterprise data: okt.to/skPbvy

242

36,848

Arthur Mensch · Apr 3, 2024 · 7:26 AM UTC

Arthur Mensch

@arthurmensch

3 Apr 2024

Very happy to partner with @awscloud to expose Mistral models on Amazon Bedrock, as we continue to bring our technology to every developer.

Amazon Web Services

@awscloud

3 Apr 2024

A colossal AI has arrived. Get large with @MistralAI. ☁️💥💻 Mistral Large is now on #AmazonBedrock. Make the most of your data with cutting-edge text generation, top-tier reasoning capabilities, & advanced language processing. #AWS #generativeAI 👉 go.aws/43GcD8V

240

51,663

Arthur Mensch · Nov 16, 2023 · 6:56 AM UTC

Arthur Mensch

@arthurmensch

16 Nov 2023

Mistral models will very soon be available as a service on Azure, thank you @satyanadella! We bring our technology where developers build.

Satya Nadella

@satyanadella

16 Nov 2023

Copilot will be the new UI for both the world's knowledge and your organization's knowledge, but most importantly, it will be your agent that helps you act on that knowledge. Here are highlights from my keynote today at #MSIgnite.

230

72,915

Arthur Mensch · May 7, 2025 · 3:21 PM UTC

Arthur Mensch

@arthurmensch

7 May 2025

Mistral Medium 2 was Miqu by the way

Mistral AI

@MistralAI

7 May 2025

Introducing Mistral Medium 3: our new multimodal model offering SOTA performance at 8X lower cost. - A new class of models that balances performance, cost, and deployability. - High performance in coding and function-calling. - Full enterprise capabilities, including hybrid or on-premises/in-VPC deployment, custom post-training, and seamless integration into enterprise tools and systems. Check out our blog to learn more:

235

31,217

Arthur Mensch · Jan 13, 2025 · 11:30 PM UTC

Arthur Mensch

@arthurmensch

13 Jan 2025

Codestral 25.01 is not only on top of the Copilot Arena leaderboard, it's also 2x faster than the first Codestral -- that matters a lot for code completion

Arena.ai

@arena

13 Jan 2025

Exciting news from @CopilotArena! The latest Codestral 25.01 release is now topping the Copilot Arena leaderboard (joint #1, +12 points over previous Codestral!). Congrats to @MistralAI🎆 Try out the new model today in the @CopilotArena VSCode extension.

231

31,442

Arthur Mensch · Mar 9, 2018 · 1:51 PM UTC

Arthur Mensch

@arthurmensch

9 Mar 2018

3 notebooks on @PyTorch : from optimization (autodiff basics) to learning (a 2-parameter MLP) to deep-learning (Fashion Mnist + learning rate tricks). Thanks @ogrisel @CharlesOllion for this collaboration ! github.com/m2dsupsdlclass/le…

220

Arthur Mensch · Mar 1, 2025 · 1:03 PM UTC

Arthur Mensch

@arthurmensch

1 Mar 2025

Notice periods are our biggest pain by far

Nando de Freitas

@NandoDF

25 Feb 2025

Two European startups @recraftai and @bfl_ml lead image generation in the world. The third place model, Imagen 3, was developed in London but under a Californian company. artificialanalysis.ai/text-t… Europe could lead in other AI products by supporting more entrepreneurship and competition. One issue is that American companies enforce 6 month to 1 year notice periods and non-competes in Europe, but don’t do it in California. A good example of this is @GoogleDeepMind. They force employees to sign these contracts or retaliate by preventing them from getting merit promotions. This is wrong - it’s time for the UK government ( @matthewclifford ) not to be seduced by Google and develop its own AI industry. @xai Grok 3 came to be because a few deepminders left Google under the protection of musk. People leaving DeepMind also led to the creation of @MistralAI - if it was easier to leave, imagine how much more competitive Europe could be. Eliminate notice periods and non-competes in Europe. It has become a question of national security.

228

49,369

Arthur Mensch · Mar 9, 2024 · 9:08 PM UTC

Arthur Mensch

@arthurmensch

9 Mar 2024

Welcome Marie!

Marie Pellat @m_pellat

8 Mar 2024

After nearly a decade at Google, I’m happy to share that I’m starting a new position at @MistralAI. If you are excited about joining a small company which has already had an outsized impact in the field, check out our roles, we are hiring!

209

45,320

Arthur Mensch · Nov 1, 2023 · 7:34 AM UTC

Arthur Mensch

@arthurmensch

1 Nov 2023

The safest way to make useful AI while mitigating misuse is to work on improving it in the open, with the highest level of scrutiny — as always in software. That’s why open-source is the one and only path toward AI safety.

211

151,038

Arthur Mensch · May 23, 2024 · 8:07 PM UTC

Arthur Mensch

@arthurmensch

23 May 2024

We are very excited to partner with Harvey to help build domain-specific models! Our platform's deployment flexibility and high customisation capabilities will help Harvey address the highly regulated legal industry. harvey.ai/blog/mistral-annou…

Mistral AI and Harvey Partnership

Announcing a new partnership with Mistral AI.

harvey.ai

200

40,646

Arthur Mensch · Jun 14, 2023 · 9:50 AM UTC

Arthur Mensch

@arthurmensch

14 Jun 2023

Totally thrilled to be alongside @GuillaumeLample and @tlacroix6 to create Mistral AI. A lot of work ahead of us!

Guillaume Lample @ NeurIPS 2024

@GuillaumeLample

14 Jun 2023

Life update: I recently left Meta, and we are starting Mistral.AI, a new AI company with @arthurmensch and @tlacroix6

198

47,863

Arthur Mensch · May 27, 2025 · 3:37 PM UTC

Arthur Mensch

@arthurmensch

27 May 2025

We shipped higher level apis for agent orchestration - MCP compatible, all server-side logic deployable privately.

Mistral AI

@MistralAI

27 May 2025

Introducing Agents API: your go-to tool for building tailored agents to solve complex real-world problems! mistral.ai/news/agents-api

199

18,074

Arthur Mensch · Jun 5, 2024 · 4:37 PM UTC

Arthur Mensch

@arthurmensch

5 Jun 2024

A first step towards faster and stronger differentiated models for your use cases. We've rethought fine-tuning to make it super easy to use, both at training and inference.

Mistral AI for Developers

@MistralDevs

5 Jun 2024

Announcing @MistralAI fine-tuning API! mistral.ai/news/customizatio…

196

55,575

Arthur Mensch · Sep 30, 2025 · 7:26 PM UTC

Arthur Mensch

@arthurmensch

30 Sep 2025

AI needs to be connected to the physical world, proud to be supporting !

Liam Fedus

@LiamFedus

30 Sep 2025

Today, @ekindogus and I are excited to introduce @periodiclabs. Our goal is to create an AI scientist. Science works by conjecturing how the world might be, running experiments, and learning from the results. Intelligence is necessary, but not sufficient. New knowledge is created when ideas are found to be consistent with reality. And so, at Periodic, we are building AI scientists and the autonomous laboratories for them to operate. Until now, scientific AI advances have come from models trained on the internet. But despite its vastness — it’s still finite (estimates are ~10T text tokens where one English word may be 1-2 tokens). And in recent years the best frontier AI models have fully exhausted it. Researchers seek better use of this data, but as any scientist knows: though re-reading a textbook may give new insights, they eventually need to try their idea to see if it holds. Autonomous labs are central to our strategy. They provide huge amounts of high-quality data (each experiment can produce GBs of data!) that exists nowhere else. They generate valuable negative results which are seldom published. But most importantly, they give our AI scientists the tools to act. We’re starting in the physical sciences. Technological progress is limited by our ability to design the physical world. We’re starting here because experiments have high signal-to-noise and are (relatively) fast, physical simulations effectively model many systems, but more broadly, physics is a verifiable environment. AI has progressed fastest in domains with data and verifiable results - for example, in math and code. Here, nature is the RL environment. One of our goals is to discover superconductors that work at higher temperatures than today's materials. Significant advances could help us create next-generation transportation and build power grids with minimal losses. But this is just one example — if we can automate materials design, we have the potential to accelerate Moore’s Law, space travel, and nuclear fusion. We’re also working to deploy our solutions with industry. As an example, we're helping a semiconductor manufacturer that is facing issues with heat dissipation on their chips. We’re training custom agents for their engineers and researchers to make sense of their experimental data in order to iterate faster. Our founding team co-created ChatGPT, DeepMind’s GNoME, OpenAI’s Operator (now Agent), the neural attention mechanism, MatterGen; have scaled autonomous physics labs; and have contributed to some of the most important materials discoveries of the last decade. We’ve come together to scale up and reimagine how science is done. We’re fortunate to be backed by investors who share our vision, including @a16z who led our $300M round, as well as @Felicis, DST Global, NVentures (NVIDIA’s venture capital arm), @Accel and individuals including @JeffBezos , @eladgil , @ericschmidt, and @JeffDean. Their support will help us grow our team, scale our labs, and develop the first generation of AI scientists.

207

36,060

Arthur Mensch · Jul 24, 2024 · 3:37 PM UTC

Arthur Mensch

@arthurmensch

24 Jul 2024

Largement sur le Pareto

Mistral AI

@MistralAI

24 Jul 2024

mistral.ai/news/mistral-larg…

182

28,944

Arthur Mensch · Sep 28, 2023 · 7:19 AM UTC

Arthur Mensch

@arthurmensch

28 Sep 2023

Mistral 7B is now in prod, nice work @perplexity_ai !

Perplexity

@perplexity_ai

28 Sep 2023

💥 Mistral 7B Instruct is available now. Try it free—labs.perplexity.ai

172

34,594

Arthur Mensch · May 16, 2019 · 4:06 AM UTC

Arthur Mensch

@arthurmensch

16 May 2019

"Geometric losses for distributional learning" w/ @mblondel_ml and @gabrielpeyre accepted @icmlconf. We derive a geometric softmax with reg. optimal transport and Fenchel duality. Accounts for cost bw/ classes, outputs discrete/*continuous* distributions. bit.ly/2HkBDKq

165

Arthur Mensch · May 13, 2019 · 12:03 AM UTC

Arthur Mensch

@arthurmensch

13 May 2019

To seriously compare two methods in deep learning (say, GANs) , you're looking at 2 methods * 5 random seeds * 5 learning rates * 2 day = 100 days of GPU usage, and that's a minimum. This is often beyond academic labs capacity, how do you cope?

161

Arthur Mensch · Feb 27, 2024 · 8:53 AM UTC

Arthur Mensch

@arthurmensch

27 Feb 2024

Replying to @izzyz

Still committing mistral.ai/ ;)

Frontier AI LLMs, assistants, agents, services | Mistral

The most powerful AI platform for enterprises. Customize, fine-tune, and deploy AI assistants, autonomous agents, and multimodal AI with open models.

mistral.ai

168

11,940

Arthur Mensch · Aug 7, 2024 · 6:28 PM UTC

Arthur Mensch

@arthurmensch

7 Aug 2024

New steps towards faster model customisation and application building

Mistral AI

@MistralAI

7 Aug 2024

mistral.ai/news/build-tweak-…

161

28,513

Arthur Mensch · Jun 13, 2019 · 9:50 PM UTC

Arthur Mensch

@arthurmensch

13 Jun 2019

(sorry for self-promotion) I will be presenting our work on a new geometric softmax with continuous output, based on OT and Fenchel duality, tonight 6:30 @icmlconf, poster #179. Joint work with @mblondel_ml and @gabrielpeyre

161

Arthur Mensch · Nov 7, 2024 · 9:15 PM UTC

Arthur Mensch

@arthurmensch

7 Nov 2024

New steps toward completing our AI platform - proud of the team!

Mistral AI

@MistralAI

7 Nov 2024

Moderation API - mistral.ai/news/mistral-mode… Batch API - mistral.ai/news/batch-api/

145

22,613

Arthur Mensch · Jan 20, 2025 · 5:41 PM UTC

Arthur Mensch

@arthurmensch

20 Jan 2025

Merci pour votre confiance @SebLecornu, construisons ensemble l'IA de Défense 🇫🇷

Sébastien Lecornu

@SebLecornu

20 Jan 2025

L'agence ministérielle pour l'IA de défense (AMIAD) va nouer prochainement un partenariat avec @MistralAI Notre futur supercalculateur classifié sera également accessible aux acteurs publics et aux entreprises qui veulent développer de l'IA de façon sécurisée.

141

13,957

Arthur Mensch · Nov 1, 2023 · 7:17 AM UTC

Arthur Mensch

@arthurmensch

1 Nov 2023

We need to hammer this home: the soon-to-become AGI of many is mere statistical knowledge compression. It’s great, it can help solving millions of problems in healthcare, in education, it can make the jobs of everybody more creativity-oriented.

126

8,371

Arthur Mensch · Feb 8, 2024 · 6:22 PM UTC

Arthur Mensch

@arthurmensch

8 Feb 2024

Very proud to be partnering with @Capgemini to help enterprises adopt our technology!

Aiman Ezzat @aiman_ezzat

8 Feb 2024

Happy to share the news of our latest partnership, announced today: leveraging Mistral AI’s LLM technology, @Capgemini aims to make #GenAI more accessible for enterprises looking to customize and deploy multiple use cases with a lower carbon footprint. bit.ly/4bsZovL

130

34,044

Arthur Mensch · May 19, 2025 · 8:54 PM UTC

Arthur Mensch

@arthurmensch

19 May 2025

Very proud of this massive scale partnership. We’re investing in infrastructure to bring AI strategic autonomy to the entire world

Emmanuel Macron

@EmmanuelMacron

19 May 2025

MGX, Bpifrance, Mistral AI et NVIDIA choisissent ensemble la France ! Ils vont développer en Île-de-France un campus IA ouvert, associant data centers, calcul de haute performance, éducation et recherche. La France compte tant de talents de l’IA !

132

10,679

Arthur Mensch · Jun 11, 2025 · 10:03 AM UTC

Arthur Mensch

@arthurmensch

11 Jun 2025

We purposely made it great at optimal transport as you may have guessed !

Lénaïc Chizat @LenaicChizat

11 Jun 2025

Just tested this model on a few challenging math questions and I found it very helpful. Magistral keeps doubting its answers ("wait, but...") & trying to improve them, which makes it great at exploring & exploiting knowledge from its train data (and it's fast). Congrats Mistral !

128

18,064

Arthur Mensch · Jul 19, 2022 · 7:27 PM UTC

Arthur Mensch

@arthurmensch

19 Jul 2022

We're presenting RETRO at 4:15pm @icmlconf with @borgeaud_s, and later today at the poster session. Add a retrieval DB to divide your model size by 10, don't miss out!

122

Arthur Mensch · Nov 22, 2023 · 11:03 PM UTC

Arthur Mensch

@arthurmensch

22 Nov 2023

Excited to partner with Cloudflare on low-latency generative AI!

Cloudflare

@Cloudflare

21 Nov 2023

Today we’re excited to announce that we’ve added the Mistral-7B-v0.1-instruct to Workers AI. Mistral 7B is a 7.3 billion parameter language model with a number of unique advantages. Try it today! cfl.re/47iSsyO

113

34,381

Arthur Mensch · Nov 28, 2022 · 11:34 PM UTC

Arthur Mensch

@arthurmensch

28 Nov 2022

At NeurIPS this week, reach out if you want to discuss our work around LLM at @DeepMind (Chinchilla, Retro, Flamingo), and if you're interested in working with us!

100

Arthur Mensch · Oct 15, 2023 · 6:52 AM UTC

Arthur Mensch

@arthurmensch

15 Oct 2023

We need less religion and more science to make AI safe and useful

martin_casado

@martin_casado

14 Oct 2023

Things we need to get past in the AI safety discussion to make progress: - Circular arguments / tautologies : AGI definitionally being the feared end goal is a substance free position. - Bad/incomplete inductive arguments : I've yet to find an inductive step that has any rigor it. - Bad analogy : the silicon stack isn't the carbon stack. AI is not a nuclear weapon. - Conflating replication with aggregate capability : Just because you get an exponential with replication does not mean you have exponential capability. To wit, 10 people with 150 IQ is not the same thing is 1 person with 1500 IQ. - Imputing exponential growth from a control system: Many (most?) control systems that get increasingly complex end up with diminishing marginal returns. Not the opposite. - Bad forms of Pascal's wager : If we actually operated this way, we'd all be every religion, and adopt every regulation for every hypothetical. - Claims with no evidentiary basis : recursive self-improvement of intelligence sounds good, but there is absolutely no evidence I can find that supports it can happen. - Appeal to experts : there are plenty on either side. Let's focus on the actual claims.

35,094

Arthur Mensch · Jul 11, 2020 · 10:26 AM UTC

Arthur Mensch

@arthurmensch

11 Jul 2020

Happy to present our work on games @icmlconf: Gradient extrapolation with alternated player update speeds-up equilibrium finding in convex games/GANs ! Joint work with S. Jelassi, C. Domingo, D. Scieur, @joanbruna @NYUDataScience. arxiv.org/pdf/1905.12363.pdf github.com/arthurmensch/dseg

Arthur Mensch · Feb 12, 2025 · 12:48 PM UTC

Arthur Mensch

@arthurmensch

12 Feb 2025

Thank you for your trust Chuck, we're extremely excited to be working with Cisco

Chuck Robbins

@ChuckRobbins

10 Feb 2025

Thrilled to announce our AI Renewals Agent, developed in partnership w/ @MistralAI. This is the first big step toward improving our customer experience with Agentic AI – big thanks to the teams driving this innovation! newsroom.cisco.com/c/r/newsr…

14,770

Arthur Mensch · Mar 21, 2025 · 8:47 AM UTC

Arthur Mensch

@arthurmensch

21 Mar 2025

I had the great pleasure to talk about sovereignty in AI with Jensen and @AnjneyMidha. We explain why every nation state needs an AI strategy and what matters for it to be successful. Full link below

19,657

Arthur Mensch · May 19, 2025 · 8:43 PM UTC

Arthur Mensch

@arthurmensch

19 May 2025

We’re bringing le Chat and our AI Platform to Dell’s hardware for on-prem deployment dell.com/en-us/blog/bringing…

5,841

Arthur Mensch · Nov 1, 2023 · 7:26 AM UTC

Arthur Mensch

@arthurmensch

1 Nov 2023

Can this change? Yes, although not without a paradigm change in the technology, and that’s why it’s great that we talk about risks. But right now the true risk is to mechanically leave the development of AI to 2 or 3 large corporations.

12,224

Arthur Mensch · Dec 19, 2023 · 3:49 PM UTC

Arthur Mensch

@arthurmensch

19 Dec 2023

Replying to @gneubig

Hello, thanks for this study! It seems you have been using a third-party model github.com/neulab/gemini-ben…… based on Mixtral base model. You'll probably get better results by working with our instructed version (mistral-small on our API). Let us know if we can help!

14,366

Arthur Mensch · Jan 28, 2025 · 9:30 PM UTC

Arthur Mensch

@arthurmensch

28 Jan 2025

DX improvements on la Plateforme!

Sophia Yang, Ph.D.

@sophiamyang

28 Jan 2025

Introducing @MistralAI Structured Outputs: Define your desired output format with @pydantic and ensure responses are structured exactly as specified🚀

15,429

Arthur Mensch · Mar 28, 2024 · 3:17 PM UTC

Arthur Mensch

@arthurmensch

28 Mar 2024

Answered some hard questions during our trip to SF. Thanks @jacobeffron and @jordan_segall for hosting!

Jordan Segall

@jordan_segall

28 Mar 2024

New Unsupervised Learning with @MistralAI CEO and Co-Founder @arthurmensch and @jacobeffron on: - Naming “Mistral” - How the LLM landscape will evolve - Mistral’s commercial strategy - Regulating AI safety YouTube: piped.video/_N2KPEdh69s Apple: apple.co/4cAWKof Spotify: spoti.fi/4cNRYUL

18,543

Arthur Mensch · Nov 1, 2023 · 7:19 AM UTC

Arthur Mensch

@arthurmensch

1 Nov 2023

But it’s not that hard to replicate. Anyone with 100M and the will to do it can create a rather good model from anywhere on earth. Good actor, or bad actor.

7,179

Arthur Mensch · Dec 6, 2017 · 12:38 AM UTC

Arthur Mensch

@arthurmensch

6 Dec 2017

We present our work on multi-task learning in fMRI at poster #151 #NIPS2017 this afternoon with @GaelVaroquaux @julienmairal @danilobzdok, feel free to come and discuss ! arxiv.org/abs/1710.11438

Arthur Mensch · Dec 8, 2021 · 4:38 PM UTC

Arthur Mensch

@arthurmensch

8 Dec 2021

Very happy to see the release of these works ! In particular, we push semi-parametric language models (a AR Transformer and a nearest-neighbor database) to an unprecedented scale, and obtain continuous improvements with DB size. The semi-parametric route holds many promises !

Google DeepMind

@GoogleDeepMind

8 Dec 2021

Replying to @GoogleDeepMind

The three studies explore: Gopher - a SOTA 280B parameter transformer, ethical and social risks, & a new retrieval architecture with better training efficiency. 1: dpmd.ai/llm-gopher 2: dpmd.ai/llm-ethics 3: dpmd.ai/llm-retrieval (more dpmd.ai/llm-retro) 2/

Arthur Mensch · Apr 11, 2024 · 9:03 PM UTC

Arthur Mensch

@arthurmensch

11 Apr 2024

Super excited by this partnership with @SnowflakeDB, with specialist models!

sridhar

@RamaswmySridhar

11 Apr 2024

Exciting to combine @SnowflakeDB 's SQL expertise with cutting edge AI from our friends at @MistralAI to create the world’s best SQL copilot!!

18,568

Arthur Mensch · Nov 1, 2023 · 7:23 AM UTC

Arthur Mensch

@arthurmensch

1 Nov 2023

The good news is, for all of the illegal usage of AI we can imagine (misinformation and knowledge search), AI currently does nothing to lift the actual bottleneck standing in the way (distribution of information, actual execution of what the LLM recommends doing, respectively)

86,786

Arthur Mensch · Jul 19, 2023 · 9:55 AM UTC

Arthur Mensch

@arthurmensch

19 Jul 2023

Great to see the release of Llama II, open-source LLMs are making good progress! Still a lot of room to improve OS models positioning on the efficiency/performance front — so that they eventually catch up with proprietary solutions. An interesting challenge 😇

16,146

Arthur Mensch · Nov 18, 2024 · 5:55 PM UTC

Arthur Mensch

@arthurmensch

18 Nov 2024

Replying to @grove0100

Canevas is a French word!

3,659

Arthur Mensch · Dec 7, 2020 · 8:52 AM UTC

Arthur Mensch

@arthurmensch

7 Dec 2020

"It directly follows from Koshi-Schwartz inequality", welcome to a subtitled #NeurIPS2020 conference that promises to be fun

Arthur Mensch · Oct 31, 2017 · 1:48 PM UTC

Arthur Mensch

@arthurmensch

31 Oct 2017

Our paper #nips2017: Multi-layer classif° and dropout regul° permit transfer learning between fMRI datasets. goo.gl/WrnWqb

Arthur Mensch · Dec 11, 2023 · 8:09 AM UTC

Arthur Mensch

@arthurmensch

11 Dec 2023

Replying to @xlr8harder @MistralAI

solved :)

4,722

Arthur Mensch · Jul 24, 2024 · 3:39 PM UTC

Arthur Mensch

@arthurmensch

24 Jul 2024

Replying to @julien_c

my number one priority

4,331

Arthur Mensch · Jul 11, 2018 · 7:54 AM UTC

Arthur Mensch

@arthurmensch

11 Jul 2018

I will be presenting this work at @icmlconf on Thursday morning, and at poster #48. Please come and discuss :-) (alpha) code online github.com/arthurmensch/didy…

GitHub - arthurmensch/didyprog: Differentiable Dynamic Programming

Differentiable Dynamic Programming. Contribute to arthurmensch/didyprog development by creating an account on GitHub.

github.com

Mathieu Blondel @mblondel_ml

10 Jul 2018

Smoothing the max operator in a dynamic program recursion induces a random walk on the computational graph. The expected path on that walk can be computed efficiently by backpropagation, which converges to backtracking as smoothing vanishes. arxiv.org/abs/1802.03676

Arthur Mensch · Sep 5, 2017 · 2:05 PM UTC

Arthur Mensch

@arthurmensch

5 Sep 2017

Our paper on faster stochastic matrix factorization to appear in IEEE TSP 😃 Convergence proofs, 10x speed-ups, code. goo.gl/F5Jgpe

Arthur Mensch · Jun 20, 2016 · 6:39 PM UTC

Arthur Mensch

@arthurmensch

20 Jun 2016

My slides #icml2016: online learning + random subsampling for fast matrix factorization. Thanks to the audience ! amensch.fr/docs/presentation…

Arthur Mensch · Dec 10, 2020 · 3:04 PM UTC

Arthur Mensch

@arthurmensch

10 Dec 2020

What an abusive subtitle system @NeurIPSConf 😇 I think there is a slight bias against the French accent, and towards French in general: the optimal transport distance (a national pride) was systematically dubbed the American transport distance !

Arthur Mensch · Apr 1, 2022 · 7:32 AM UTC

Arthur Mensch

@arthurmensch

1 Apr 2022

Large language models are too big: very happy to share this work which is a welcome finding for computational sobriety! If you're into training big models, consider a dataset collection specialist for your next hire, more text tokens are needed ;)

This Post is from an account that no longer exists.