Co-founder & CEO @HuggingFace 🤗, the open and collaborative platform for AI builders

Miami / New York / Paris
Hot take: Git was the wrong abstraction for 90% of ML data. Checkpoints, optimizer states, training logs, agent traces - none of this needs version control. It needs fast, cheap, mutable storage. So we built Buckets. S3-like storage on the @huggingface Hub with Xet dedup and zero egress. Train in a bucket. Publish to a repo. One platform. 🤗🤗🤗
82
78
1,088
150,508
Looks like we're going to welcome two more Hugging Faces to the family next year. My wife is a hero! 💛💛💛@ReginaSil2
1,472
858
36,760
1,182,579
I don’t follow @elonmusk and yet I get served dozens of his tweets in my timeline every day. Independently of politics, it feels like X has become his propaganda machine to be honest. Isn’t it the very thing he was trying to fight in the first place?
1,327
437
7,894
628,606
Once again, an AI system is not "thinking", it's "processing", "running predictions",... just like Google or computers do. Giving the false impression that technology systems are human is just cheap snake oil and marketing to fool you into thinking it's more clever than it is.
435
1,097
7,584
723,799
If you think @Apple is not doing much in AI, you're getting blindsided by the chatbot hype and not paying enough attention! They just released FastVLM and MobileCLIP2 on @huggingface. The models are up to 85x faster and 3.4x smaller than previous work, enabling real-time vision language model (VLM) applications! It can even do live video captioning 100% locally in your browser 🤯🤯🤯
229
575
6,406
854,082
Great research on open-source by @Harvard: - $4.15B invested in open-source generates $8.8T of value for companies (aka $1 invested in open-source = $2,000 of value created) - Companies would need to spend 3.5 times more on software than they currently do if OSS did not exist I suspect that these numbers and impact are even greater for AI than for software (would be great to study!)
135
1,391
5,734
792,741
Just 10 days after o1's public debut, we’re thrilled to unveil the open-source version of the groundbreaking technique behind its success: scaling test-time compute 🧠💡 By giving models more "time to think," LLaMA 1B outperforms LLaMA 8B in math—beating a model 8x its size. The full recipe is open-source🤯 This is the power of open science and open-source AI! 🌍✨
114
606
4,475
496,946
I would pick @ylecun over @elonmusk every single day of the week. Despite getting much less $$, recognition & visibility than entrepreneurs, the scientists who publish their groundbreaking research openly are the cornerstone of technological progress & massively contribute to making the world a better place!
374
254
3,968
538,107
Our science team has started working on fully reproducing and open-sourcing R1 including training data, training scripts,... Full power of open source AI so that everyone all over the world can take advantage of AI progress! Will help debunk some myths I’m sure too. Thanks @deepseek_ai!
118
507
4,040
347,349
lol choose the odd one out
Tech tycoons with a combined net worth of roughly $550 billion gathered in the same room Wednesday for a Senate forum on the future and regulation of AI bloom.bg/3RlmAV3
151
84
3,519
1,055,995
Should we acquire Stability and open-source SD3?
402
188
3,206
349,915
I think India could become an AI superpower and we're starting to see early signs!
182
208
2,708
261,269
This was the most important wedding of the weekend! No crutches or lightnings could stop us. Can’t wait to spend the rest of my life with my cutie pie!
348
29
2,702
159,015
When @sama told me at the AI summit in Paris that they were serious about releasing open-source models & asked what would be useful, I couldn’t believe it. But six months of collaboration later, here it is: Welcome to OSS-GPT on @huggingface! It comes in two sizes, for both maximum reasoning capabilities & on-device, cheaper, faster option, all apache 2.0. It’s integrated with our inference partners that power the official demo. This open-source release is critically important & timely, because as @WhiteHouse emphasized in the US Action plan, we need stronger American open-source AI foundations. And who could do that better than the very startup that has been pioneering and leading the field in so many ways. Feels like a plot twist. Feels like a comeback. Feels like the beginning of something big, let’s go open-source AI 🔥🔥🔥
87
245
2,669
168,748
This isn’t a goal of ours because we have plenty of money in the bank but quite excited to see that @huggingface is profitable these days, with 220 team members and most of our platform being free (like model hosting) and open-source for the community! Especially noteworthy at a time when most AI startups wouldn’t survive a year or two without VC money. Great job team!
147
151
2,372
443,873
Six predictions for AI in 2024: - A hyped AI company will go bankrupt or get acquired for a ridiculously low price - Open-source LLMs will reach the level of the best closed-source LLMs - Big breakthroughs in AI for video, time-series, biology and chemistry - We will talk much more about the cost (monetary and environmental) of AI - A popular media will be mostly AI-generated - 10 millions AI builders on Hugging Face leading to no increase of unemployment
163
277
2,260
912,145
Super excited to welcome our new investors @SalesforceVC, @Google, @amazon, @nvidia, @AMD, @intel, @QualcommVenture, @IBM & @sound_ventures_ who all participated in @huggingface’s $235M series D at a $4.5B valuation to celebrate the crossing of 1,000,000 models, datasets and apps on the platform. These partners alone already shared over 1,000 open models and datasets and have over 10,000 users on Hugging Face. It takes a village to democratize good machine learning thanks to open-source and we’re just getting started! 🤗🎉🚀
252
296
2,161
813,419
1T parameters, open-weights, just released on @huggingface!
59
174
2,165
302,892
Every tech company can and should train their own deepseek R1, Llama or GPT5, just like every tech company writes their own code (and AI is no more than software 2.0). This is why we're releasing the Ultra-Scale Playbook. 200 pages to master: - 5D parallelism (DP, TP, PP, EP, FSDP) - ZeRO - Flash Attention - Compute/communication overlap and bottlenecks All with accessible theory intros and 4,000+ scaling experiments.
48
268
2,110
169,902
This is my 5-minute testimony before the US Congress! Open science and open source AI distribute economic gains by enabling hundreds of thousands of small companies and startups to build with AI. It fosters innovation, and fair competition between all. Thanks to ethical openness, it creates a safer path for development of artificial intelligence by giving civil society, non-profits, academia, and policy makers the capabilities they need to counterbalance the power of big private companies. Open science and open source AI prevent blackbox systems, make companies more accountable, and help solving today’s challenges like mitigating biases, reducing misinformation, promoting copyright, & rewarding all stake-holders including artists & content creators in the value creation process. Let's go!
79
358
1,948
616,405
Am I wrong in sensing a paradigm shift in AI? Feels like we’re moving from a world obsessed with generalist LLM APIs to one where more and more companies are training, optimizing, and running their own models built on open source (especially smaller, specialized ones) Some validating signs just in the past few weeks: - @karpathy released nanochat to train models in just a few lines of code - @thinkymachines launched a fine-tuning product - rising popularity of @vllm_project, @sgl_project, @PrimeIntellect, Loras, trl,... - 1M new repos on HF in the past 90 days (including the first open-source LLMs from @OpenAI) And now, @nvidia just announced DGX Spark, powerful enough for everyone to fine-tune their own models at home. Would you agree, or am I just seeing the future I want to exist? Also, why is this happening (just the advent of RL/post-training?)
149
213
2,025
384,858
Where’s the open-source @elonmusk ?
110
132
1,882
122,808
I believe we need open-source alternatives to ChatGPT for more transparency, inclusivity, accountability and distribution of power. Excited to introduce HuggingChat, an open-source early prototype interface, powered by OpenAssistant, a model that was released a few weeks ago.
55
382
1,832
235,797
Hanging out with the Paris @huggingface robotics team! Let’s make robotics open-source and community-driven
44
141
1,795
99,157
Hugging Face has quietly become the biggest AI app store with 400,000 total apps, 2,000 new apps created every day, getting visited 2.5M times every week! Now you can search through any of them with AI or categories. The future of AI will be distributed, have fun everyone!
95
244
1,781
103,759
My prediction: in 2024, most companies will realize that smaller, cheaper, more specialized models make more sense for 99% of AI use-cases. The current market & usage is fooled by companies sponsoring the cost of training and running big models (especially with cloud incentives).
The cost of running AI is becoming an issue, even in domains where people are prepared to pay for it: "Report: GitHub Copilot Loses an Average of $20 Per User Per Month" thurrott.com/cloud/290661/re…
66
251
1,677
660,186
The @LeRobotHF team is studying the @UnitreeRobotics G1 today in case you have any questions or fun stuff you want us to try!
162
147
1,746
172,300
We just released the best 3B model, 100% open-source, open dataset, architecture details, exact data mixtures and full training recipe including pre-training, mid-training, post-training, and synthetic data generation for everyone to train their own. Let's go open-source AI!
Introducing SmolLM3: a strong, smol reasoner! > SoTA 3B model > dual mode reasoning (think/no_think) > long context, up to 128k > multilingual: en, fr, es, de, it, pt > fully open source (data, code, recipes) huggingface.co/blog/smollm3
42
236
1,715
196,448
No excuse anymore not to train your own models! This 200+ pages with full transparency. Let's go open-source AI!
Training LLMs end to end is hard. Very excited to share our new blog (book?) that cover the full pipeline: pre-training, post-training and infra. 200+ pages of what worked, what didn’t, and how to make it run reliably huggingface.co/spaces/Huggin…
37
164
1,703
246,907
When you realize that open-source is at the frontier of AI despite: - less GPUs - less money - less public and policy support - no $100M salaries to attract talent - with closed-source taking advantage and copying all the innovations of open-source without contributing back theirs 🤯🤯🤯 And we’re just getting started!
84
155
1,647
175,420
Everyone is talking about how we need more AI data centers (especially the ones who would mostly benefit from them) but why is no one talking about on-device AI? Running AI on your device: - Free - Faster & takes advantage of existing hardware - 100% privacy and control (you don’t send your data to an API)
186
199
1,632
230,413
When you run AI on your device, it is more efficient and less big brother and free! So it's very cool to see the new llama.cpp UI, a chatgpt-like app that fully runs on your laptop without needing wifi or sending any data external to any API. It supports: - 150,000+ GGUF models - Drop in PDFs, images, or text documents - Branch and edit conversations anytime - Parallel chats and image processing - Math and code rendering - Constrained generation with JSON schema supported Well done @ggerganov and team!
49
169
1,663
150,681
Unsurprisingly, Kimi K2 Thinking is already number one trending on HF. The AI frontier is open-source!
47
150
1,638
308,855
The main breakthrough of GPT-5 was to route your messages between a couple of different models to give you the best, cheapest & fastest answer possible. This is cool but imagine if you could do this not only for a couple of models but hundreds of them, big and small, fast and slow, in any language or specialized for any task - all at inference time. This is what we're introducing with HuggingChat Omni, powered by over 100 open-source models including gpt-oss, deepseek, qwen, kimi, smolLM, gemma, aya and many more already! And this is just the beginning as there are over 2 millions open models not only for text but image, audio, video, biology, chemistry, time-series and more on @huggingface!
116
158
1,657
268,807
Now you can interact with 100k+ open-source models - including Stable Diffusion, bioGPT, Flan, Bloom,... - and your own private models, in JS! Let's build AI better together!
33
219
1,593
321,726
Meta starts open-sourcing a lot and is now becoming one of the best companies in the world at shipping AI features. Coincidence? I don’t think so. Contrary to popular belief, a company (or a country) sharing their research, models and datasets publicly in open-source makes them MORE competitive, not LESS, even more so in AI. IMO, that’s how the US and some companies like OAI & Google established their leadership in the past few years (even though they are not so open anymore). Some of the reasons why open-sourcing makes companies more competitive: - Open science and open source attracts and motivates the best talents who want to to contribute to the field - It focuses organization on the speed of building - not on taking advantage of the current tech - especially important on a fast moving domain like AI - It motivates the whole field to improve what you’re building on (bug fixing, optimization, new capabilities) that you can then really easily integrate in your products). Is your company sharing their research, models and datasets? If not, they’re missing out! Source: wired.com/story/meta-ai-chat…
49
227
1,566
923,107
False! Because it's open-source, fast and cheap, it will help create thousands of silicon valley startups, hundreds of thousands of jobs, massively step up competition and create hundreds of billions of value for silicon valley. IMO more positively impactful than any and all closed-source AI startups (and it should be a wake-up call for these startups to share more!). vox.com/technology/397330/de…
89
169
1,542
83,464
Hugging Face is becoming the best place to share the most viral AI apps with spaces. Kolors Virtual Try-on just crossed 6,000,000 unique visitors & is now the #5 most popular space. Congrats to the Kwai Kolors team! huggingface.co/spaces/Kwai-K…
30
126
1,573
187,868
June 2024. Happened faster than I thought
53
131
1,514
69,962
Similarly to @Github copilot, you can now do question-answering in Google Sheet thanks to the TAPAS model from @GoogleAI & the @huggingface inference API. Machine Learning making its way in each and every product! Great job @osanseviero!
14
320
1,498
If AI stays closed-source, proprietary and monopolistic like it is now, it will destroy lots of jobs and just make the richest companies richer and more powerful! If we open it up thanks to open science and open-source, foster competition and decentralization of value and control, it will create many more jobs and economic value than it destroys. Let’s go!
165
183
1,500
154,020
Open-source superintelligence for all!
120
135
1,503
152,579
We're open-sourcing "The Amazing Hand", an eight-degree of freedom humanoid robot hand compatible with @lerobot that can be 3-D printed at home for less than $250 ✌️✌️✌️ Given the success of Reachy Mini (2,000+ robots sold in a few days), we won't have the bandwidth to manufacture this one ourselves but we release the bill of materials, the CAD files and assembly guides for everyone to build or sell their own, let's go open-source AI robotics!
58
258
1,470
210,010
.@nvidia is not holding back!
34
67
1,392
190,587
It's been released just a few days ago and already more than 500 derivative models of @deepseek_ai have been created all over the world on @huggingface with 2.5 million downloads (5x the original weights). The power of decentralized open-source AI!
49
218
1,333
228,459
Let's make AI robotics open-source!
85
165
1,367
183,492
Deepseek R1 just crossed 10,000 likes on HF and is now the most liked model amongst almost 1.5M public models. Mind-blowing!
33
120
1,312
88,881
I don't get why so many people are dunking on Meta. Building frontier AI models is freaking hard & they have consistently released 10 times more openly than others which is game-changer for the field. What about all the other big tech & startups with much more AI ressources that are releasing much less openly? Should we name names?
108
79
1,349
119,669
Deepseek R1 just became the most liked model ever on @huggingface just a few weeks after release - with thousands of variants downloaded over 10 million times now!
54
184
1,324
123,948
Am I the only one tired of LLM releases to gain 5% of accuracy? Time for audio, video, time-series, medical, biology, chemistry AI releases to get more of the spotlight please!
51
141
1,298
87,487
Apple is back! 20 new coreML models for on-device AI & 4 new datasets just dropped on HF: huggingface.co/apple
17
203
1,305
185,943
We just crossed 1,000,000 free public models on Hugging Face! That’s the ones the media covers like Llama, Gemma, Phi, Flux, Mistral, Phi, Starcoder, Qwen, Stable diffusion, Grok, Whisper, Olmo, Command, Zephyr, OpenELM, Jamba, Yi but also 999,984 others. Why? Because contrary to the “1 model to rule them all” fallacy, smaller specialized customized optimized models for your use-case, your domain, your language, your hardware and generally your constraints are better. As a matter of fact, something that few people realize is that there are almost as many models on Hugging Face that are private only to one organization - for companies to build AI privately, specifically for their use-cases. Today a new repository (model, dataset or space) is created every 10 seconds on HF. Ultimately, there’s going to be as many models as code repositories and we’ll be here for it! Cheers to the community!
61
175
1,277
182,494
Is @elonmusk replicating the OpenAI playbook? From open-source (grok 1) to closed-source (grok 2) From pro startups to regulatory capture If you can’t beat them, join them I guess haha
143
75
1,208
194,894
The @huggingface team waiting for @OpenAI to release their open-source model
45
64
1,216
94,506
One week later...
India's biggest AI startup, $1B Sarvam, just launched its flagship LLM. It's a 24B Mistral small post trained on Indic data with a mere 23 downloads 2 days after launch. In contrast, 2 Korean college trained an open-source model that did ~200k last month. Embarrassing.
32
63
1,224
117,638
Machine Learning is becoming the default way to build technology, mostly thanks to open-source & open-science but I don’t think we’re anywhere close to singularity/AGI/terminator/AI God (if it even exists) (1/5)
49
130
1,111
Building a startup in ML these days
✌🏼 find me on Threads @thatalliemason
20
101
1,080
NLP is going to be the most transformational tech of the decade!
37
158
1,136
Kimi K2 Thinking feels like a big milestone for open-source AI. The first time in a while that open-source gets ahead of proprietary APIs on their big area of focus (agents). Fun to see that it's happening at a time when the proprietary APIs have the most money/attention they've ever had (narrative violation to "they will win because they have more money/compute"). The paradigm shift continues!
Am I wrong in sensing a paradigm shift in AI? Feels like we’re moving from a world obsessed with generalist LLM APIs to one where more and more companies are training, optimizing, and running their own models built on open source (especially smaller, specialized ones) Some validating signs just in the past few weeks: - @karpathy released nanochat to train models in just a few lines of code - @thinkymachines launched a fine-tuning product - rising popularity of @vllm_project, @sgl_project, @PrimeIntellect, Loras, trl,... - 1M new repos on HF in the past 90 days (including the first open-source LLMs from @OpenAI) And now, @nvidia just announced DGX Spark, powerful enough for everyone to fine-tune their own models at home. Would you agree, or am I just seeing the future I want to exist? Also, why is this happening (just the advent of RL/post-training?)
32
92
1,164
136,483
Did you notice how most video AIs are commercial, closed-source & secret? It sucks and a big reason for the lack of open-source & transparency is that there are very few high-quality open video datasets. The @huggingface team is changing this today by releasing FineVideo, a high-quality annotated video dataset with a creative commons license. Time to build!
51
143
1,133
105,420
GPU-Poor no more: super excited to officially release ZeroGPU in beta today. Congrats @victormustar & team for the release! In the past few months, the open-source AI community has been thriving. Not only Meta but also Apple, NVIDIA, Bytedance, Snowflake, Databricks, Microsoft, Google, and more have released open models and datasets on Hugging Face, which now hosts over 1M models on the Hub which have been downloaded over a billion times. More than that, many are starting to be better than proprietary APIs. This movement has been supported not only by big tech but also by a thriving open-source AI community that includes academic labs, startups, and independent hobbyists. For example, more than 35,000 variation models of Llama have been shared on Hugging Face since Meta’s first version a year ago—including more than 7,000 based on Llama-3—ranging from quantized and merged models to specialized models in biology and Mandarin, to name a few. More than 4 million AI builders are now using Hugging Face. However, the open-source community doesn’t have the same resources available to train and demo these models that big tech have at their disposal, which is why ChatGPT remains the most used AI application today. @huggingface is fighting this by launching ZeroGPU, a shared infrastructure for indie and academic AI builders to run AI demos on Spaces, giving them the freedom to pursue their work without the financial burden of compute costs. Spaces have been the most popular way to build AI demos, with over 300,000 AI demos created so far on CPU or paid GPU (and a thousand more every day). To foster the continued development of the AI ecosystem, Hugging Face is committing $10M of free GPUs with the launch today of ZeroGPU. Technically speaking, ZeroGPU leverages Hugging Face's experience in hosting and serving more than 100 Petabytes monthly from the Hugging Face Hub. ZeroGPU allows Spaces to run on multiple GPUs by making Spaces efficiently hold and release GPUs as needed (as opposed to a classical GPU Space that holds exactly one GPU at any time). This architecture is also more energy-efficient since GPUs are shared rather than duplicated. ZeroGPU uses @nvidia A100 GPU devices under the hood. You can learn more about ZeroGPU here: huggingface.co/zero-gpu-expl… More than 1,300 ZeroGPU spaces have been built since we started giving early access to AI builders on May 1, 2024: huggingface.co/spaces/enzost… You can explore some examples from @victormustar: nitter.app/victormustar/sta… You can find the article from @kyliebytes: theverge.com/2024/5/16/24156… 🤗🤗🤗
ZeroGPU is the real deal for AI applications. ⬇️ Listing of some concrete apps that anyone can use right now (with direct links) ⬇️
65
221
1,073
275,687
We're 250 team members 😅😅😅
76
45
1,117
81,471
For me, Google is the #1 AI company in the world and kickstarted the whole revolution with "Attention is all you need", BERT, T5 and many more. I'm ecstatic to be able to collaborate with @sundarpichai @ThomasOrTK @JeffDean @fchollet and the whole Google org to democratize AI thanks to open-source AI and open science! Let's go!
36
149
1,094
245,600
IMO, if GPT 4.5 was released as an open-source base model (that everyone can distill), it would be the most impactful release of the year. As an API, it's meh...
43
55
1,100
97,343
No cloud, no cost, no data sent to anyone, no problem. Welcome to local AI on Hugging Face!
39
142
1,064
149,594
For years, we've been saying that bigger isn't always better for AI and that smaller specialized models are usually faster, cheaper and more accurate for your specific constraints. So super happy to release the long-overdue capability of finding the best model based on size on @huggingface! It took us a bit more time than expected to release because we needed to wait for safetensors & GGUF to become ubiquitous to do it easily but it's now live. Time for you to stop listening to AI influencers and find the best model (small or large) for you and your use-case!
44
115
1,094
97,992
Give us a billion more and we’ll add an emoji in there 😅🤗
21
15
1,028
154,528
Trump just suspended the visa program that allowed me to move to the US to start @huggingface! Unfortunately, I won’t be able to vote in a few months but if you can, please vote him out, he's destroying what made America great in so many different ways! axios.com/immigration-ban-h1…
16
141
1,038
I'd like to live in a world where all countries & organizations, big or small, can train their own AI, instead of just a few of the richest & biggest ones. This is IMO one of the most important topics ever and will shape the future of the world. That's why I'm super proud when we release things like the ultra-scale playbook. We hope the community will like it and get inspired to share more of their learning too!
51
142
1,069
63,983
Should we acquire Humane to open-source the pin?
284
23
1,022
294,000
Requiring a license to train models would be like requiring a license to write code. IMO, it would further concentrate power in the hands of a few & drastically slow down progress, fairness & transparency.
50
188
1,010
199,264
I've said it and will say it again: #1 Smaller, cheaper, faster, more customized models will cover 99% of use-cases. You don't need a million dollar formula 1 to get to work everyday and you don't need a banking customer success chatbot to tell you the meaning of life! #2 All companies will ultimately want to build AI themselves (based on open-source AI) versus outsource to third-party APIs and there will be as many models as code repos today. AI is a foundational technology to build tech and the same way you don't outsource or have the same code-base as your competitors, you won't want to outsource your AI development or use the same models as your competitors.
What's going on here? Microsoft wants to build its own advanced AI to reduce costs and dependence on OpenAI. What does this mean? Microsoft is directing its researchers to create conversational AI models that perform nearly as well as OpenAI's but are much smaller and cheaper to run. This in-house AI is already being tested by Microsoft's Bing team for features similar to ChatGPT. The goal is to save on the ballooning compute costs of large AI models. Even though Microsoft has invested over $10 billion in OpenAI for exclusive access to its tech, unchecked costs from widespread use could get out of hand fast. Smaller "distilled" models that mimic the capabilities of behemoths like GPT-4 can help control expenses while providing customers with powerful AI features. Why should I care? This gives Microsoft options beyond OpenAI to deliver performant and affordable AI products. It's a smart business move to avoid overreliance on external partners, however fruitful the relationship may be currently. Plus having in-house alternatives puts Microsoft in a better negotiating position with OpenAI down the road.
38
167
1,046
318,354
Baidu just released 23 models at the same time on @huggingface - from 0.3B to 424B parameters. Let’s go!
36
140
1,050
165,966
Every day, over 1,500 terabytes of open models and datasets are downloaded and uploaded between @huggingface and @googlecloud by millions of AI builders. We suspect it generates over a billion dollars of cloud spend annually already. So we’re excited to announce today a new partnership to make it faster, safer & cheaper for Google Cloud customers to work with Hugging Face. With them, we will: - reduce Hugging Face model & dataset upload and download times through Vertex AI and Google Kubernetes Engine thanks to a new gateway for Hugging Face repositories that will cache directly on Google Cloud - offer native support for TPUs on all open models sourced through Hugging Face - provide a safer experience through Google Cloud’s built-in security capabilities. Ultimately, our intuition is that the majority of cloud spend will be AI related and based on open-source (rather than proprietary APIs) as all technology builders will become AI builders. And both Google Cloud and Hugging Face will be there for it, let's go!
25
93
1,067
118,726
Hugging Face was founded by immigrants
Many of the companies that are founded by immigrants will be the next set of Fortune 500 companies that will create millions of jobs in the future. And many of the founders of these companies once came here working at another company. This is America’s secret weapon.
44
46
960
87,862
If you're a researcher or engineer releasing open science papers & open models and datasets, I bow to you 🙇🙇🙇 From what I'm hearing, doing so, especially in US big tech, often means fighting your manager and colleagues, going through countless legal meetings, threatening to quit or taking a lower paycheck, and sometimes the result is only that you'll get scolded when what you shared is used by competitors. But, please remember: research papers and open models and datasets is how progress happens! Your efforts are pushing AI toward a more open and collaborative future. Thanks to openness, your research or models get a chance to be noticed, seen & built upon by people you respect to accelerate progress, grow your network & accelerate your impact. It might be tough right now but open science will ultimately prevail as it always did! The researchers & engineers that we'll remember in ten years are the ones who share what they build, not the ones that keep it behind closed-doors for company profit maximization. Please keep fighting for openness. We see you and we thank you! 💚💛 💙💜
41
115
1,022
102,670
1,000,000 🤗
60
17
1,007
74,962
Reminder that you can't trust one company (or a few) to build safe AGI. The safest path is open, transparent and decentralized!
56
122
956
126,171
AI is not a zero-sum game. Open-source AI is the tide that lifts all boats!
77
95
961
57,994
The more I grow, the more I believe the #1 mission of a startup ceo is to fight the natural destructive tendencies of growing orgs like processes, meetings, verticalization and centralization, politics and in general everything that slows down your ability to keep increasing your impact.
47
70
949
290,848
Six predictions for AI in 2025 (and a review of how my 2024 predictions turned out): - There will be the first major public protest related to AI - A big company will see its market cap divided by two or more because of AI - At least 100,000 personal AI robots will be pre-ordered - China will start to lead the AI race (as a consequence of leading the open-source AI race). - There will be big breakthroughs in AI for biology and chemistry. - We will begin to see the economic and employment growth potential of AI, with 15M AI builders on Hugging Face. How my predictions for 2024 turned out: - A hyped AI company will go bankrupt or get acquired for a ridiculously low price ✅ (Inflexion, AdeptAI,...) - Open-source LLMs will reach the level of the best closed-source LLMs ✅ with QwQ and dozens of others - Big breakthroughs in AI for video, time-series, biology and chemistry ✅ for video 🔴for time-series, biology and chemistry - We will talk much more about the cost (monetary and environmental) of AI ✅Monetary 🔴Environmental (😢) - A popular media will be mostly AI-generated ✅  with NotebookLM by Google - 10 millions AI builders on Hugging Face leading to no increase of unemployment 🔜currently 7M of AI builders on Hugging Face
89
183
951
148,956
Love how @Apple is advocating for on-device AI at WWDC . Local, smaller, specialized models are the future of private, secure and efficient AI.
36
95
914
85,409
Could we stop comparing raw models (llama, stable diffusion,...) with APIs (gpt4, claude,...). Most APIs probably include a lot of engineering tricks and even several models under the hood chained or MOEd together so these are not the same things at all and can't be compared.
36
109
921
156,180
let's open superintelligence for all!
91
54
944
78,255
We crossed 1M models on Hugging Face!
38
70
919
79,792
With the Google announcement last week, I think we're now officially the only AI startup out there who has commercial collaborations with all the major cloud providers (AWS, GCP, Azure) and hardware providers (Nvidia, AMD, Intel, Qualcomm,...), making our vision of being the independent and agnostic platform for all AI builders truer than ever! Let's go!
26
83
906
176,607
Maybe we should buy Cline 😅😅😅
🤗🤗🤗 🤗❤️🤗 @huggingface & Cline = your LLM playground 🤗🤗🤗 You can access Kimi K2 & 6,140 (!) other open source models in Cline.
66
43
917
120,114
I've said it and will say it again. Concentration of power is the biggest risk in AI!
52
117
886
141,484
We were chilling until @levelsio triggered our monetization anxiety. New premium team plan now live 😅😅😅
How in the frick does @HuggingFace make money? They host literally everyone's AI models They don't ask for an API key when using their API for free They don't charge me when I run a model either ????????
40
26
930
202,029
We collaborated with the European Space Agency to open-source the largest ever earth observation dataset: Major TOM Core! About half of the entire planet is covered. That's 2,245,886 patches of 1068 x 1068 pixels. At 10m resolution, we've got 256 million square km with over 2.5 trillion pixels. More datasets from different satellites are in preparation and anyone can join this collaborative effort thanks to their organization on Hugging Face: huggingface.co/Major-TOM. Quoting @mikonvergence @esa: “democratizing Earth Observation model development and deployment with transparent, reproducible, and traceable tools - starts with the data!” You can explore the data here huggingface.co/spaces/Major-… & access the dataset here: huggingface.co/datasets/Majo…
23
213
854
129,760
We thought we would get xAI open-source but got zAI so even better 😅😅😅
41
43
884
99,741
To me, huge ML models are to machine learning what formula 1 is to the car industry!
24
66
853
The GPT4 of datasets took down Hugging Face, sorry all 😅😅😅
22
38
850
276,456
8 years later, we finally got the .com domain of @huggingface! If anything, maybe an example not to sweat the small details too early & trust your guts as so many people told us this was a shitty name!
63
41
853
67,629
Pumped to announce the brand new open LLM leaderboard. We burned 300 H100 to re-run new evaluations like MMLU-pro for all major open LLMs! Some learning: - Qwen 72B is the king and Chinese open models are dominating overall - Previous evaluations have become too easy for recent models, much like grading high school students on middle school problems - There are indications that AI builders have started to focus on the main evaluations too much at the expense of model performances on other ones - Bigger is not always smarter Really excited about how the field is maturing on evaluation, let's go!
41
124
847
270,424
Google is back baby! Taking the first spot for open models on the @huggingface LLM leaderboard for its sizes (2B & 7B): huggingface.co/collections/g…
24
115
762
105,971