Yannic Kilcher 🇸🇨 · May 18, 2021 · 2:52 PM UTC

Yannic Kilcher 🇸🇨

Pinned Tweet

Yannic Kilcher 🇸🇨

@ykilcher

18 May 2021

🥳Special Video🥳This has been in the works for a while. I used CLIP + BigGAN to make a music video for a song with lyrics made from ImageNet class labels🤠"Be my weasel", performed by me on a looper🎸Code & references available, make your own! Enjoy🤟 piped.video/rR5_emVeyBk

107

709

Yannic Kilcher 🇸🇨 · Mar 14, 2023 · 6:01 PM UTC

Yannic Kilcher 🇸🇨

@ykilcher

14 Mar 2023

GPT-4 paper literally is just saying "we trained a model on data and it's better". Spread over 98 pages.

236

3,003

370,112

Yannic Kilcher 🇸🇨 · Apr 15, 2023 · 5:00 PM UTC

Yannic Kilcher 🇸🇨

@ykilcher

15 Apr 2023

🔥EVERYONE🔥We’re excited to announce the release of OpenAssistant. The future of AI development depends heavily on high quality datasets and models being made publicly available, and that’s exactly what this project does. Watch the annoucement video: piped.video/ddG2fM9i4Kk

ALT OpenAssistant Conversational AI for everyone

468

2,001

928,705

Yannic Kilcher 🇸🇨 · Nov 20, 2021 · 3:46 PM UTC

Yannic Kilcher 🇸🇨

@ykilcher

20 Nov 2021

"neural networks focus too much on texture"

Peyman Milanfar

@docmilanfar

18 Nov 2021

The prior in your brain is wrong. This isn’t fried chicken

154

1,277

Yannic Kilcher 🇸🇨 · May 17, 2023 · 4:31 AM UTC

Yannic Kilcher 🇸🇨

@ykilcher

17 May 2023

Asking for more regulation is a classic move of market leaders to suppress all competition. How petty of OpenAI to sink to this level.

The New York Times

@nytimes

16 May 2023

Sam Altman, CEO of the start-up behind the AI chatbot ChatGPT, agreed with members of the Senate on Tuesday on the need to regulate increasingly powerful AI technology. nyti.ms/435J9Qt

158

1,275

211,294

Yannic Kilcher 🇸🇨 · Aug 20, 2023 · 1:12 AM UTC

Yannic Kilcher 🇸🇨

@ykilcher

20 Aug 2023

So the M stands for...?

152

1,060

276,921

Yannic Kilcher 🇸🇨 · Nov 20, 2023 · 11:26 PM UTC

Yannic Kilcher 🇸🇨

@ykilcher

20 Nov 2023

Ok hear me out, each person who leaves OpenAI just has to memorize a billion weights then we get GPT-4

1,010

99,623

Yannic Kilcher 🇸🇨 · Dec 24, 2024 · 10:41 PM UTC

Yannic Kilcher 🇸🇨

@ykilcher

24 Dec 2024

🔥New Video🔥 I delve (ha!) into Byte Latent Transformer: Patches Scale Better Than Tokens where the authors do away with tokenization and create an LLM architecture that operates on dynamically sized "patches" instead of tokens. By controlling the patch size, they gain a level of control over the tradeoff between model size and FLOPs and use that to achieve more favorable scaling behavior than classically tokenized LLMs. Watch here: piped.video/loaTGpqfctI

101

1,013

83,809

Yannic Kilcher 🇸🇨 · May 21, 2021 · 10:18 AM UTC

Yannic Kilcher 🇸🇨

@ykilcher

21 May 2021

There are 4 big Machine Learning conferences now: NeurIPS, ICML, ICLR, and Google I/O

115

958

Yannic Kilcher 🇸🇨 · Dec 14, 2024 · 8:42 AM UTC

Yannic Kilcher 🇸🇨

@ykilcher

14 Dec 2024

This is petty

OpenAI

@OpenAI

13 Dec 2024

Elon Musk wanted an OpenAI for-profit. openai.com/index/elon-musk-w…

971

182,073

Yannic Kilcher 🇸🇨 · May 1, 2021 · 12:41 PM UTC

Yannic Kilcher 🇸🇨

@ykilcher

1 May 2021

it's a doozy

115

882

Yannic Kilcher 🇸🇨 · May 1, 2023 · 10:29 PM UTC

Yannic Kilcher 🇸🇨

@ykilcher

1 May 2023

We must urgently stop all further development on this new "keyboard" technology. In the near future, anyone will just be able to type anything!!! The world will be flooded with fake news and civilization will fall😱

829

277,073

Yannic Kilcher 🇸🇨 · Oct 4, 2020 · 11:36 AM UTC

Yannic Kilcher 🇸🇨

@ykilcher

4 Oct 2020

🔥New Video🔥Convolutions are DEAD as Transformers continue to ruin absolutely everything 😱 New SotA on ImageNet, VTAB, etc using only Transformers + massive data 👑 Also Peer Review is broken. Watch Now!👀 piped.video/TrdevFK_am4 @GoogleAI @giffmana @__kolesnikov__ @XiaohuaZhai

179

827

Yannic Kilcher 🇸🇨 · Mar 6, 2024 · 7:58 PM UTC

Yannic Kilcher 🇸🇨

@ykilcher

6 Mar 2024

NEW & BREAKING: A Sharpie engineer has spent months testing its "pen" product. He's disturbed by the violent/sexual content it can create & Walmart's decision not to take it off the shelves to investigate.

(((ل()(ل() 'yoav))))👾

@yoavgo

6 Mar 2024

NEW & BREAKING: An Adobe engineer has spent months testing its image-generator software, Photoshop. He's disturbed by the violent/sexual content it can create & Adobe's decision not to take it offline to investigate.

703

148,082

Yannic Kilcher 🇸🇨 · Jun 25, 2020 · 2:19 PM UTC

Yannic Kilcher 🇸🇨

@ykilcher

25 Jun 2020

New Video 🔥 Deep Learning is very good at fitting functions numerically, but what about deriving symbolic expressions? How Graph Networks can learn Newtonian Physics and Dark Matter! piped.video/LMb5tvW-UoQ @MilesCranmer @PeterWBattaglia @KyleCranmer @DavidSpergel @cosmo_shirley

155

659

Yannic Kilcher 🇸🇨 · Dec 24, 2023 · 3:56 PM UTC

Yannic Kilcher 🇸🇨

@ykilcher

24 Dec 2023

🔥New Video🔥 "Isn't Mamba just a fancy LSTM?" - turns out, there are some key differences! This video is a close look at selective state spaces: piped.video/9dSkvxS2EB0

645

49,469

Yannic Kilcher 🇸🇨 · Oct 6, 2021 · 9:26 PM UTC

Yannic Kilcher 🇸🇨

@ykilcher

6 Oct 2021

"Grokking" is weird: Neural Networks trained to fill in binary operation tables will quickly overfit to the training data, but after many, many steps suddenly "get it" and achieve 100% validation accuracy. piped.video/dND-7llwrpw

112

621

Yannic Kilcher 🇸🇨 · Sep 18, 2022 · 8:45 AM UTC

Yannic Kilcher 🇸🇨

@ykilcher

18 Sep 2022

Stable Diffusion had a good run. It was the cool new kid on the block. Sadly, it's now in TensorFlow. Have fun with it boomers...

Divam Gupta

@divamgupta

17 Sep 2022

Stable Diffusion implemented using @Tensorflow and #Keras. - Converted pre-trained models - Easy to understand code - Minimal code footprint Code : github.com/divamgupta/stable… Google Colab with @Gradio demo : colab.research.google.com/dr…

622

Yannic Kilcher 🇸🇨 · Jun 9, 2020 · 2:10 PM UTC

Yannic Kilcher 🇸🇨

@ykilcher

9 Jun 2020

This model learns, unsupervised, to translate code from Python to C++, including standard library calls and type inference! 👀 Watch this video to find out how! piped.video/xTzFJIknh7E @MaLachaux @b_roziere @LowikChanussot @GuillaumeLample @facebookai

144

572

Yannic Kilcher 🇸🇨 · Jul 29, 2023 · 12:44 PM UTC

Yannic Kilcher 🇸🇨

@ykilcher

29 Jul 2023

Another sad day for open source. I personally wrote the first version of token-streaming for this.

Jeff Boudier 🤗

@jeffboudier

28 Jul 2023

Today is a huge milestone for one of our latest libraries: Text Generation Inference - we released v1.0 and under a new license: HFOIL 1.0 github.com/huggingface/text-… This 🧵 explains what this new license means, and why the change!

565

244,946

Yannic Kilcher 🇸🇨 · Aug 6, 2025 · 3:01 AM UTC

Yannic Kilcher 🇸🇨

@ykilcher

6 Aug 2025

To the contrary, people don't forget GPT-2. People vividly remember that, quite unprecedented, OpenAI refused to share code or weights for GPT-2 and single handedly started an era of closed models and commercialism over science.

clem 🤗

@ClementDelangue

5 Aug 2025

And just like that, @OpenAI gpt-oss is now the number one trending model on @huggingface, out of almost 2M open models 🚀 People sometimes forget that they've already transformed the field: GPT-2, released back in 2019 is HF's most downloaded text-generation model ever, and Whisper has consistently ranked in the top 5 audio models. Now that they are doubling down on openness, they may completely transform the AI ecosystem, again. Exciting times ahead!

593

49,680

Yannic Kilcher 🇸🇨 · Jul 3, 2023 · 8:55 PM UTC

Yannic Kilcher 🇸🇨

@ykilcher

3 Jul 2023

Men who use LARGE language models, is it possible that you're compensating for something?

553

59,707

Yannic Kilcher 🇸🇨 · Jun 3, 2022 · 3:50 PM UTC

Yannic Kilcher 🇸🇨

@ykilcher

3 Jun 2022

This is the worst AI ever! I trained a language model on 4chan's /pol/ board and the result is.... more truthful than GPT-3?! See how my bot anonymously posted over 30k posts on 4chan and try it yourself. Watch here (warning: may be offensive): piped.video/efPrtcLdcdM

547

Yannic Kilcher 🇸🇨 · Nov 22, 2023 · 11:49 PM UTC

Yannic Kilcher 🇸🇨

@ykilcher

22 Nov 2023

The AI ethics community is dead. It has no more power. This is good because it was never about ethics. Next are the effective altruists, most of which are neither effective nor altruistic. Sanity will win

551

56,044

Yannic Kilcher 🇸🇨 · Jun 18, 2022 · 9:03 AM UTC

Yannic Kilcher 🇸🇨

@ykilcher

18 Jun 2022

Apparently, Stanford is putting together a strongly worded letter against me. I'm not kidding. A strongly worded letter.

549

Yannic Kilcher 🇸🇨 · Apr 8, 2024 · 3:35 PM UTC

Yannic Kilcher 🇸🇨

@ykilcher

8 Apr 2024

🔥New Video🔥 Flow matching (not classic diffusion) is the basis for state-of-the-art text to image models, like Stable Diffusion 3. Here is how it works: piped.video/7NNxK3CqaDk

536

40,365

Yannic Kilcher 🇸🇨 · Apr 30, 2022 · 1:00 PM UTC

Yannic Kilcher 🇸🇨

@ykilcher

30 Apr 2022

JavaScript be like "==" the same "===" really the same "====" really, actually the same "=====" you won't even believe how the same those things are

512

Yannic Kilcher 🇸🇨 · Apr 14, 2023 · 8:39 PM UTC

Yannic Kilcher 🇸🇨

@ykilcher

14 Apr 2023

Programming is now just arguing with models.

509

49,085

Yannic Kilcher 🇸🇨 · Jan 8, 2022 · 10:43 AM UTC

Yannic Kilcher 🇸🇨

@ykilcher

8 Jan 2022

Conclusion: If we make the car bigger, it will probably work.

Francesco Orabona

@bremen79

7 Jan 2022

When you debug a machine learning model

511

Yannic Kilcher 🇸🇨 · May 29, 2020 · 3:23 PM UTC

Yannic Kilcher 🇸🇨

@ykilcher

29 May 2020

GPT-3 is out and it is HUGE 🤯 Turns out that a pure Language Model can zero-shot almost any NLP Task! Here's my video summary of this 175 BILLION parameter beast! piped.video/SY5PvZrJhLE @nottombrown @8enmann @AlecRad @Dario_Amodei @arvind_io @girishsastry @AmandaAskell @ilyasut

141

521

Yannic Kilcher 🇸🇨 · Apr 12, 2024 · 5:22 PM UTC

Yannic Kilcher 🇸🇨

@ykilcher

12 Apr 2024

NeurIPS introduces a track dedicated to advancing kids of rich parents even more than they already are

Gautam Kamath @thegautamkamath

12 Apr 2024

NeurIPS 2024 will have a track for papers from high schoolers.

509

93,885

Yannic Kilcher 🇸🇨 · Oct 31, 2023 · 7:38 AM UTC

Yannic Kilcher 🇸🇨

@ykilcher

31 Oct 2023

I have a secret for you... #manufacturedoutrage

Nikki Teran @DrNikkiTeran

30 Oct 2023

Will releasing the weights of large language models grant widespread access to pandemic agents? Turns out, yes, probably. 1/5

494

81,328

Yannic Kilcher 🇸🇨 · Feb 11, 2024 · 11:53 AM UTC

Yannic Kilcher 🇸🇨

@ykilcher

11 Feb 2024

According to the current AI landscape, Microsoft Word is Open Source, because I can use it for free as a student.

462

56,095

Yannic Kilcher 🇸🇨 · Jun 2, 2023 · 10:41 PM UTC

Yannic Kilcher 🇸🇨

@ykilcher

2 Jun 2023

🔥New Video🔥 RWKV takes the best of both worlds: Transformers and RNNs and combines them into a scalable architecture that is refreshingly different. This video dives deep into how it works and where its tradeoffs are: piped.video/x8pW19wKfXQ

471

61,572

Yannic Kilcher 🇸🇨 · Apr 12, 2024 · 8:48 PM UTC

Yannic Kilcher 🇸🇨

@ykilcher

12 Apr 2024

To all "it's merit based" responders: if you reward skills before they are introduced in the public school system, the vast majority of rewardees will come from extremely privileged backgrounds that support and incentivize them to acquire those skills privately.

Yannic Kilcher 🇸🇨

@ykilcher

12 Apr 2024

NeurIPS introduces a track dedicated to advancing kids of rich parents even more than they already are

467

45,860

Yannic Kilcher 🇸🇨 · May 24, 2023 · 2:47 PM UTC

Yannic Kilcher 🇸🇨

@ykilcher

24 May 2023

Shocking: A trained model beats an untrained model. It's 2023 everyone 😁

@_akhaliq

24 May 2023

Goat: Fine-tuned LLaMA Outperforms GPT-4 on Arithmetic Tasks introduce Goat, a fine-tuned LLaMA model that significantly outperforms GPT-4 on a range of arithmetic tasks. Fine-tuned on a synthetically generated dataset, Goat achieves state-of-the-art performance on BIG-bench arithmetic sub-task. In particular, the zero-shot Goat-7B matches or even surpasses the accuracy achieved by the few-shot PaLM-540B. Surprisingly, Goat can achieve near-perfect accuracy on large-number addition and subtraction through supervised fine-tuning only, which is almost impossible with previous pretrained language models, such as Bloom, OPT, GPT-NeoX, etc paper page: huggingface.co/papers/2305.1…

461

85,757

Yannic Kilcher 🇸🇨 · Feb 4, 2023 · 2:12 PM UTC

Yannic Kilcher 🇸🇨

@ykilcher

4 Feb 2023

It's surprisingly fun to collect data for OpenAssistant - Our open-source alternative to ChatGPT! Check out the video: piped.video/64Izfm24FKA #openassistant #chatgpt

462

86,417

Yannic Kilcher 🇸🇨 · Aug 13, 2022 · 11:23 AM UTC

Yannic Kilcher 🇸🇨

@ykilcher

13 Aug 2022

Who is behind #StableDiffusion? Check out this interview with @EMostaque , Founder of Stability AI. We chat about open sourcing models, building a giant compute cluster from scratch, and how he envisions a true democratization of AI. piped.video/YQ2QtKcK2dA @StableDiffusion

ALT stability.ai AI by the people, for the people.

448

Yannic Kilcher 🇸🇨 · Jun 7, 2022 · 5:09 AM UTC

Yannic Kilcher 🇸🇨

@ykilcher

7 Jun 2022

AI Ethics people just mad I Rick rolled them.

418

Yannic Kilcher 🇸🇨 · May 6, 2021 · 3:22 PM UTC

Yannic Kilcher 🇸🇨

@ykilcher

6 May 2021

🔥Short Video🔥MLP-Mixer by @GoogleAI already has about 20 GitHub implementations in less than a day. An only-MLP network reaching competitive ImageNet- and Transfer-Performance due to smart weight sharing! Check it! piped.video/7K4Z8RqjWIk @neilhoulsby @giffmana @__kolesnikov__

437

Yannic Kilcher 🇸🇨 · Feb 1, 2021 · 7:57 PM UTC

Yannic Kilcher 🇸🇨

@ykilcher

1 Feb 2021

guys...😂

430

Yannic Kilcher 🇸🇨 · Oct 3, 2020 · 4:16 PM UTC

Yannic Kilcher 🇸🇨

@ykilcher

3 Oct 2020

New Video 🥳 This paper uses lots of compute to learn a single, unified LSTM-based optimizer on over 6000(!) different tasks, then uses that optimizer to TRAIN ITSELF! We're in full meta-land 😱 piped.video/3baFTP0uYOc @GoogleAI @Luke_Metz @niru_m @bucketofkets @poolio @jaschasd

410

Yannic Kilcher 🇸🇨 · Dec 2, 2021 · 10:38 PM UTC

Yannic Kilcher 🇸🇨

@ykilcher

2 Dec 2021

Oh yes, "neural networks", that one algorithm 😁

Santiago

@svpino

2 Dec 2021

There are thousands of machine learning algorithms out there, but you'll rarely need more than a handful. A good start: • Linear/Logistic Regression • Decision Trees • Neural Networks • XGBoost • Naive Bayes • PCA • KNN • Random Forests • K-Means

403

Yannic Kilcher 🇸🇨 · Jul 3, 2021 · 1:03 PM UTC

Yannic Kilcher 🇸🇨

@ykilcher

3 Jul 2021

🔥New Video🔥an analysis of @karpathy's talk about Tesla's full self-driving system, using NOTHING BUT VISION🤯 Major themes: Auto-labelling to collect data, careful detection of edge-cases, and the massive benefits of owning the entire pipeline💪 piped.video/9MJTeOaSMTk

ALT Full self-driving Vision Only Tesla

415

Yannic Kilcher 🇸🇨 · Nov 26, 2023 · 9:30 PM UTC

Yannic Kilcher 🇸🇨

@ykilcher

26 Nov 2023

A strategy that never fails: Reel them in with the hype, then, when they least expect it, educate them! piped.video/nOBm4aYEYR4

408

45,150

Yannic Kilcher 🇸🇨 · Jun 5, 2021 · 5:34 PM UTC

Yannic Kilcher 🇸🇨

@ykilcher

5 Jun 2021

🔥New Video🔥Decision Transformers gets remarkably good performance on Offline RL by just ditching everything RL and using sequence modeling🤯Check it piped.video/-buULmf7dec @lchen915 @_kevinlu @aravindr93 @kimin_le2 @adityagrover_ @MishaLaskin @pabbeel @AravSrinivas @IMordatch

397

Yannic Kilcher 🇸🇨 · Oct 17, 2020 · 1:32 PM UTC

Yannic Kilcher 🇸🇨

@ykilcher

17 Oct 2020

🔥New Video🔥 LambdaNetworks capture long-range interactions as linear functionals🤯 Super complicated, basically Transformers without the giant memory requirements 🥳 New SotA on ImageNet! 💪 Watch Now! piped.video/3qxJ2WD8p4w #ICLR2021

405

Yannic Kilcher 🇸🇨 · Oct 7, 2021 · 10:29 PM UTC

Yannic Kilcher 🇸🇨

@ykilcher

7 Oct 2021

How do Machine Learners diet? They turn on weight decay.

396

Yannic Kilcher 🇸🇨 · Jun 24, 2020 · 1:50 PM UTC

Yannic Kilcher 🇸🇨

@ykilcher

24 Jun 2020

New Video 🔥 How I Read A Machine Learning Paper Here's my process of reading and understanding the DETR object detection paper in an efficient manner. piped.video/Uumd2zOOz60

398

Yannic Kilcher 🇸🇨 · Apr 14, 2021 · 1:42 PM UTC

Yannic Kilcher 🇸🇨

@ykilcher

14 Apr 2021

🔥Special Video🔥I built a Neural Network in Minecraft 🎲No Mods, No Command Blocks 📶Analog, not Digital ⛏️Backpropagation & Weight Updates ⚙️Fully Automatic 🧑‍💻Open Source This video details what it does, how it works, and how it's built. Don't miss it 😉 piped.video/7OdhtAiPfWY

ALT Neural Network in Minecraft

385

Yannic Kilcher 🇸🇨 · Dec 16, 2020 · 1:42 PM UTC

Yannic Kilcher 🇸🇨

@ykilcher

16 Dec 2020

Replying to @lexfridman

Continued by GPT-3: "2. Those who cannot In the first case, the person is a scientist. In the second case, the person is a journalist."

391

Yannic Kilcher 🇸🇨 · Sep 25, 2021 · 4:09 PM UTC

Yannic Kilcher 🇸🇨

@ykilcher

25 Sep 2021

I am [x] pro vaccine [x] anti excessive government pressure yet when I protest the latter, I'm immediately lumped in with the antivax crowd. My opinion is not registered anywhere because people like me just don't speak up, and I feel I'm not the only one. Who else feels this way?

387

Yannic Kilcher 🇸🇨 · May 1, 2021 · 7:57 PM UTC

Yannic Kilcher 🇸🇨

@ykilcher

1 May 2021

Join me for this video👯We take a look at @facebookai's DINO architecture, pushing Self-Supervised Learning for Vision Transformers to truly impressive levels🔥🔥🔥 Check it out! piped.video/h3ij3F3cPIk @julienmairal @armandjoulin

ALT DINO: Emergint Properties in Self-Supervised Vision Transformers

390

Yannic Kilcher 🇸🇨 · Jun 6, 2022 · 9:01 PM UTC

Yannic Kilcher 🇸🇨

@ykilcher

6 Jun 2022

I asked this person twice already for an actual, concrete instance of "harm" caused by gpt-4chan, or even a likely one that couldn't be done by e.g. gpt-2 or gpt-j (or a regex for that matter), but I'm being elegantly ignored 🙃

Lauren Oakden-Rayner 🏳️‍⚧️@DrLaurenOR

6 Jun 2022

This week an #AI model was released on @huggingface that produces harmful + discriminatory text and has already posted over 30k vile comments online (says it's author). This experiment would never pass a human research #ethics board. Here are my recommendations. 1/7

I agree with KCramer. There is nothing wrong with making a 4chan-based model and testing how it behaves.

The main concern I have is that this model is freely accessible for use. While open science is a great principle, I'm a medical doctor and safety researcher by training and we always need to consider possible harms. Human research ethics is baked into the very foundation of our field, because of a long history of human rights abuses in the name of science, in particular experiments that cause harm to disempowered and marginalised people without their consent.

It should be clear that this model carries a significant risk for this sort of harm, given the fact such an experiment has already been performed. The model author has used this model to produce a bot that made tens of thousands of harmful and discriminatory online comments on a publicly accessible forum, a forum that tends to be heavily populated by teenagers no less. There is no question that such human experimentation woul

ALT I agree with KCramer. There is nothing wrong with making a 4chan-based model and testing how it behaves. The main concern I have is that this model is freely accessible for use. While open science is a great principle, I'm a medical doctor and safety researcher by training and we always need to consider possible harms. Human research ethics is baked into the very foundation of our field, because of a long history of human rights abuses in the name of science, in particular experiments that cause harm to disempowered and marginalised people without their consent. It should be clear that this model carries a significant risk for this sort of harm, given the fact such an experiment has already been performed. The model author has used this model to produce a bot that made tens of thousands of harmful and discriminatory online comments on a publicly accessible forum, a forum that tends to be heavily populated by teenagers no less. There is no question that such human experimentation woul

Text from huggingface discussion: Given the demonstrated risk of harm, this model should not be freely accessible. The medical community has well established guidelines on how to manage the sharing of research materials which involve a risk to human subjects, with data privacy being the most common risk. It is common to allow research access to datasets in this context via a registration platform, where the applicants who are seeking access must describe their proposed research, and sign an agreement for data use. See the NIH/TCIA and MIMIC datasets for examples. The latter even has a requirement for applicants to pass a course in human research ethics prior to obtaining access to the data.

A similar system should be in place here, and be used as the template for future model sharing where the model has the potential to produce harm.

ALT Text from huggingface discussion: Given the demonstrated risk of harm, this model should not be freely accessible. The medical community has well established guidelines on how to manage the sharing of research materials which involve a risk to human subjects, with data privacy being the most common risk. It is common to allow research access to datasets in this context via a registration platform, where the applicants who are seeking access must describe their proposed research, and sign an agreement for data use. See the NIH/TCIA and MIMIC datasets for examples. The latter even has a requirement for applicants to pass a course in human research ethics prior to obtaining access to the data. A similar system should be in place here, and be used as the template for future model sharing where the model has the potential to produce harm.

359

Yannic Kilcher 🇸🇨 · Feb 7, 2023 · 6:02 AM UTC

Yannic Kilcher 🇸🇨

@ykilcher

7 Feb 2023

Google has a weird definition of "shared".

Sundar Pichai

@sundarpichai

6 Feb 2023

1/ In 2021, we shared next-gen language + conversation capabilities powered by our Language Model for Dialogue Applications (LaMDA). Coming soon: Bard, a new experimental conversational #GoogleAI service powered by LaMDA. blog.google/technology/ai/ba…

377

76,225

Yannic Kilcher 🇸🇨 · Feb 14, 2021 · 4:59 PM UTC

Yannic Kilcher 🇸🇨

@ykilcher

14 Feb 2021

New👏Video👏NFNets achieve new ImageNet SotA by DROPPING batchnorm😱They train 9 times faster than EfficientNet and excel at transfer learning🔥Code is available, too💪Watch now & don't miss some spicy comments from me😄 piped.video/rNkHjZtH0RQ @ajmooch @sohamde_ @SamuelMLSmith

391

Yannic Kilcher 🇸🇨 · Aug 9, 2020 · 3:51 PM UTC

Yannic Kilcher 🇸🇨

@ykilcher

9 Aug 2020

New Video 🥳 Modern Hopfield Networks can store & retrieve exponentially many patterns and have a surprising and intricate connection to Transformer Attention Mechanism! 🔥 piped.video/nv6oFDp6rNQ @HRamses2 @MichaelWidrich @milenapavl @SandveGeir @victorgreiff @jbrandi6 @LITAILab

377

Yannic Kilcher 🇸🇨 · Feb 20, 2024 · 10:38 AM UTC

Yannic Kilcher 🇸🇨

@ykilcher

20 Feb 2024

Yesterday I released a video going over V-JEPA, how it works, and why it matters (including a recap of the original JEPA). Watch here: piped.video/7UkJPwz_N_0

376

45,138

Yannic Kilcher 🇸🇨 · Jun 21, 2022 · 6:11 PM UTC

Yannic Kilcher 🇸🇨

@ykilcher

21 Jun 2022

Replying to @percyliang

Could you please at least link the video in the letter so people can make up their own mind?

346

Yannic Kilcher 🇸🇨 · Aug 29, 2021 · 9:42 PM UTC

Yannic Kilcher 🇸🇨

@ykilcher

29 Aug 2021

It's 2025. MLP-Supermixer-200T outperforms every human at every task. ... "bUt DoEs It ReAlLy UnDeRsTaNd AnYtHiNg?"

360

Yannic Kilcher 🇸🇨 · Jun 1, 2024 · 10:24 PM UTC

Yannic Kilcher 🇸🇨

@ykilcher

1 Jun 2024

I've made a video explaining xLSTM. Watch here: piped.video/0OaEv1a5jUM

362

32,685

Yannic Kilcher 🇸🇨 · Feb 27, 2021 · 3:54 PM UTC

Yannic Kilcher 🇸🇨

@ykilcher

27 Feb 2021

🔥New Video🔥GLOM is @geoffreyhinton's new Computer Vision idea🥳The model represents part-whole hierarchies into implicit parse trees via a multi-step attention-based consensus algorithm👀Excited? Me too! Watch the video to find out more!👇 piped.video/cllFzkvrYmE @GoogleAI

357

Yannic Kilcher 🇸🇨 · Dec 1, 2020 · 3:31 PM UTC

Yannic Kilcher 🇸🇨

@ykilcher

1 Dec 2020

🔥New Video🔥 @DeepMind AlphaFold 2 delivers major AI breakthrough in Protein Folding🧬Beats all competition by HUGE margins🤯Watch to learn how AlphaFold 1 works and what we can guess about AlphaFold 2💪 (Hint: Transformers 😉) piped.video/B9PL__gVxLI @demishassabis #AlphaFold2

350

Yannic Kilcher 🇸🇨 · Nov 27, 2021 · 7:17 PM UTC

Yannic Kilcher 🇸🇨

@ykilcher

27 Nov 2021

🔥New Video🔥How to backpropagate through an algorithm? Seems crazy, but this paper shows it's actually possible for a large class of algorithms, such as k-subset, ILP, and many graph algorithms. Watch my (amateur 🙃) attempt at an explanation here: piped.video/W2UT8NjUqrk

ALT Implicit-MLE Backpropagation Through Algorithms

347

Yannic Kilcher 🇸🇨 · Jan 5, 2023 · 9:22 PM UTC

Yannic Kilcher 🇸🇨

@ykilcher

5 Jan 2023

Give us the models? 🤷‍♀️

Logan Kilpatrick

@OfficialLoganK

5 Jan 2023

If you are a developer using the @OpenAI API, DALL-E, ChatGPT, etc. what can we do to make the developer experience better? 🧵👇

340

41,250

Yannic Kilcher 🇸🇨 · Sep 2, 2022 · 8:23 PM UTC

Yannic Kilcher 🇸🇨

@ykilcher

2 Sep 2022

Turns out loading models from the hub (or any other place) is ⚠️ NOT SAFE ⚠️ and opens you up to arbitrary code execution by an attacker🤯 Learn how to do it yourself (and how to protect against it) in this video: piped.video/2ethDz9KnLk

339

Yannic Kilcher 🇸🇨 · Jul 4, 2020 · 12:46 PM UTC

Yannic Kilcher 🇸🇨

@ykilcher

4 Jul 2020

New Video 🔥 No more O(N^2) complexity in Transformers: Kernels to the rescue! 🥳 This paper makes Attention linear AND shows an intriguing connection between Transformers and RNNs 💪 piped.video/hAooAOFRsYc @angeloskath @apoorv2904 @nik0spapp @francoisfleuret @EPFL_en @Idiap_ch

342

Yannic Kilcher 🇸🇨 · Oct 7, 2022 · 10:53 PM UTC

Yannic Kilcher 🇸🇨

@ykilcher

7 Oct 2022

🔥New Video🔥This almost seems like magic🪄DeepMind's AlphaTensor finds new algorithms for doing matrix multiplication that use less multiplication operations(!) than any algorithm humans have discovered so far. Watch here to see how they do it 👇 piped.video/3N3Bl5AA5QU

337

Yannic Kilcher 🇸🇨 · Sep 17, 2022 · 2:52 PM UTC

Yannic Kilcher 🇸🇨

@ykilcher

17 Sep 2022

How to make your CPU as fast as a GPU? 🔥 Nir Shavit explains how clever algorithms can make use of sparsity in neural networks to deliver unprecedented inference speed, without any need for specialized hardware! Watch here: piped.video/0PAiQ1jTN5k

329

Yannic Kilcher 🇸🇨 · May 10, 2021 · 7:59 AM UTC

Yannic Kilcher 🇸🇨

@ykilcher

10 May 2021

One month from now: SotA on ImageNet by really large logistic regression on patches.

324

Yannic Kilcher 🇸🇨 · May 10, 2023 · 9:02 PM UTC

Yannic Kilcher 🇸🇨

@ykilcher

10 May 2023

For the common good, download and backup this model. huggingface.co/ehartford/Wiz…

QuixiAI/WizardLM-7B-Uncensored · Hugging Face

We’re on a journey to advance and democratize artificial intelligence through open source and open science.

huggingface.co

324

89,544

Yannic Kilcher 🇸🇨 · Jul 5, 2020 · 3:36 PM UTC

Yannic Kilcher 🇸🇨

@ykilcher

5 Jul 2020

Computer Vision just got an Upgrade 🔥 SpineNet is a smaller, better and faster replacement to ResNet by @GoogleAI obtained using Neural Architecture Search 💪 Watch the Video 👀 piped.video/qFRfnIRMNlk @Phyyysalis @tanmingxing @YinCui1 @quocleix Thumbnail Art by Lucas Ferreira!

328

Yannic Kilcher 🇸🇨 · Apr 10, 2022 · 10:13 AM UTC

Yannic Kilcher 🇸🇨

@ykilcher

10 Apr 2022

🎉ML News: Generative MEGA-Models🎉 - Google PaLM: Amazing 540B Transformer - OpenAI DALL-E 2: Text-to-Image breakthrough - Open CLIP, open VQGAN diffusion, open datasets - Salesforce CodeGen - ...and the surprises one finds in Zurich 😉 piped.video/RJwPN4qNi_Y

ALT Generative Mega-Models OpenAI DALL-E 2 | Google PaLM

323

Yannic Kilcher 🇸🇨 · Nov 19, 2022 · 12:38 PM UTC

Yannic Kilcher 🇸🇨

@ykilcher

19 Nov 2022

Meta, you did almost everything right. Now grow a pair and keep that demo up.

306

Yannic Kilcher 🇸🇨 · Jan 13, 2024 · 4:15 PM UTC

Yannic Kilcher 🇸🇨

@ykilcher

13 Jan 2024

New Video on the recently released Mixtral of Experts paper. We look into sparse mixture of experts routing, and note the distinct absence of any mention whatsoever where the training data came from. Watch here: piped.video/mwO6v4BlgZQ

319

29,314

Yannic Kilcher 🇸🇨 · Jan 15, 2024 · 6:34 AM UTC

Yannic Kilcher 🇸🇨

@ykilcher

15 Jan 2024

People who advocate for "safe" LLMs sometimes don't consider what this word means to other people

Jack Clark

@jackclarkSF

13 Jan 2024

Just checking in on alignment of LLMs in China, it's going about how you'd expect.

305

35,681

Yannic Kilcher 🇸🇨 · Mar 30, 2021 · 2:17 PM UTC

Yannic Kilcher 🇸🇨

@ykilcher

30 Mar 2021

🥳Special Video🥳You've just started PhD have no clue what to do? Welcome to the club🙂A Survival Guide for PhDs in Machine Learning🧑‍🔬How to do topic selction, conferences, paper writing & what I learned from many mistakes👍Watch, Like, Share🔥Thank You piped.video/rHQPBqMULXo

ALT How to Machine Learning PhD Survival Guide

306

Yannic Kilcher 🇸🇨 · May 22, 2024 · 6:41 PM UTC

Yannic Kilcher 🇸🇨

@ykilcher

22 May 2024

Google search is now illegal

Deedy

@deedydas

22 May 2024

BREAKING: California’s newly passed AI bill requires models trained with over 10^26 flops to — not be fine tunable to create chemical / biological weapons — immediate shut down button — significant paperwork and reporting to govt

302

50,819

Yannic Kilcher 🇸🇨 · Apr 6, 2023 · 11:16 PM UTC

Yannic Kilcher 🇸🇨

@ykilcher

6 Apr 2023

🔥Here we go🔥 The first OpenAssistant models are out! We have collected the most amazing human dataset ever and it shows: This model is really cool! Watch the video to see it in action and come give it a try: piped.video/Hi6cbeBY2oQ

306

68,393

Yannic Kilcher 🇸🇨 · Mar 22, 2021 · 5:36 PM UTC

Yannic Kilcher 🇸🇨

@ykilcher

22 Mar 2021

👉Paper Explained Video👈Today: @DeepMind's new Perceiver model solves Transformers' quadratic bottleneck by using cross-attention into a self-attentive RNN backbone🦴Can attend to 50k pixels at once!👀Watch Now! piped.video/P_xeshTnPZg @drew_jaegle @OriolVinyalsML @joaocarreira

306

Yannic Kilcher 🇸🇨 · Nov 2, 2020 · 2:02 PM UTC

Yannic Kilcher 🇸🇨

@ykilcher

2 Nov 2020

🎉 New Video 🎉 Knowledge Graphs are very expensive to make, they need human experts. Or do they? 🧐 What if we replaced them with BERT or GPT-2? 🤯 Turns out, works really well, all without training! 🥳 piped.video/NAJOZTNkhlI @ChenguangWang @dawnsongtweets @ShawLiu12 #AI #NLP

293

Yannic Kilcher 🇸🇨 · Dec 9, 2022 · 11:16 PM UTC

Yannic Kilcher 🇸🇨

@ykilcher

9 Dec 2022

Oh no! Now we will be flooded with fictional plays that are COMPLETELY MADE UP!!1!

Google DeepMind

@GoogleDeepMind

9 Dec 2022

Introducing Dramatron, a new tool for writers to co-write theatre and film scripts with a language model. 🎭 Dramatron can interactively co-create new stories complete with title, characters, location descriptions and dialogue. Try it yourself now: dpmd.ai/dramatron-github

294

Yannic Kilcher 🇸🇨 · Sep 18, 2022 · 5:31 PM UTC

Yannic Kilcher 🇸🇨

@ykilcher

18 Sep 2022

A joke. It's called a joke. Oh the things people can get offended by 🤦🏽‍♀️

Gus (🤖🧠+🐍+🥑🗣️)@gusthema

18 Sep 2022

This is one reason why people are afraid of contributing to the community -Divam did a great job! spent their time creating something super cool and shared with everyone -Just to have someone come and shi* on their head for no reason! This is very sad! Don't be like that!

296

Yannic Kilcher 🇸🇨 · Sep 3, 2023 · 12:11 PM UTC

Yannic Kilcher 🇸🇨

@ykilcher

3 Sep 2023

🚀New Video🚀 ReST bootstraps its own extended dataset and trains on ever higher-quality subsets of it. Re-using generated data multiple times means an efficiency advantage with respect to Online RL techniques like PPO. Watch here: piped.video/V4dO2pyYGgs

301

39,285

Yannic Kilcher 🇸🇨 · Apr 15, 2023 · 5:00 PM UTC

Yannic Kilcher 🇸🇨

@ykilcher

15 Apr 2023

To make things even better, we are making this entire dataset free and accessible to all who wish to use it. Check it out today at huggingface.co/OpenAssistant ! 🎉

OpenAssistant (OpenAssistant)

Org profile for OpenAssistant on Hugging Face, the AI community building the future.

huggingface.co

286

44,466

Yannic Kilcher 🇸🇨 · Jan 6, 2021 · 1:54 PM UTC

Yannic Kilcher 🇸🇨

@ykilcher

6 Jan 2021

🔥New Video🔥 EVERYBODY is talking about @OpenAI's new DALL·E model 👀 It takes any piece of text and turns it into an image, absolutely crazy 😱 Watch the video to learn more💪 piped.video/j4xgkjWlfL4 #DALLE @ilyasut @_jongwook_kim @MikhailPavlov5 @gabeeegoooh @scottgray76

292

Yannic Kilcher 🇸🇨 · Nov 22, 2020 · 1:50 PM UTC

Yannic Kilcher 🇸🇨

@ykilcher

22 Nov 2020

🔥New Video🔥 FFT Magic🪄Fourier Neural Operators speed up PDE solvers by orders(!) of magnitude 🤯 Trained once, solve entire PDE families for any discretization!🎉Watch to find out more⏭️ piped.video/IaS72aHrJKE @ZongyiLiCaltech @kazizzad @AnimaAnandkumar @Caltech #ai #science

291

Yannic Kilcher 🇸🇨 · Oct 26, 2020 · 4:57 PM UTC

Yannic Kilcher 🇸🇨

@ykilcher

26 Oct 2020

🔥New Video🔥 Linear Attention! Unbiased Estimator! Random Features! Orthogonal Features! Low Variance! Tight Bounds! Kernels! Backw. Compatible! The PERFORMER has it all🤯 Watch!💪 piped.video/xJrKIPwVwGM @XingyouSong @kchorolab @andreea_gane @lukaszkaiser @dmdohan @CambridgeMLG

282

Yannic Kilcher 🇸🇨 · Mar 16, 2024 · 7:34 AM UTC

Yannic Kilcher 🇸🇨

@ykilcher

16 Mar 2024

There is already a Switzerland of AI. It's called Switzerland

Paul Graham

@paulg

15 Mar 2024

Brexit may yet turn out to have been a good idea, if it means the UK can be the Switzerland of AI.

290

50,449

Yannic Kilcher 🇸🇨 · Apr 12, 2024 · 8:49 PM UTC

Yannic Kilcher 🇸🇨

@ykilcher

12 Apr 2024

No son of a construction worker is just going to randomly start doing ML research if they never hear of it and don't get told that it could be important for their future career, no matter how intelligent the kid is

275

14,373

Yannic Kilcher 🇸🇨 · May 17, 2024 · 7:39 AM UTC

Yannic Kilcher 🇸🇨

@ykilcher

17 May 2024

Been there done that 😁

Elon Musk

@elonmusk

17 May 2024

Replying to @OpenAI

Bfd, Grok is partnering with 4Chan mfw

283

40,079

Yannic Kilcher 🇸🇨 · Oct 24, 2022 · 8:50 PM UTC

Yannic Kilcher 🇸🇨

@ykilcher

24 Oct 2022

How to make money with NFTs: 1. Buy an NFT 2. Use it as a reminder for the rest of your life to not make shitty decisions.

279

Yannic Kilcher 🇸🇨 · Apr 29, 2022 · 6:40 PM UTC

Yannic Kilcher 🇸🇨

@ykilcher

29 Apr 2022

my_opinions != your_opinions my_opinions = !your_opinions important difference

270

Yannic Kilcher 🇸🇨 · Dec 3, 2020 · 8:32 AM UTC

Yannic Kilcher 🇸🇨

@ykilcher

3 Dec 2020

Ok I get it, I'm not not the favorite child 😁

Demis Hassabis

@demishassabis

3 Dec 2020

Replying to @lexfridman

Thanks Lex, great video!

271

Yannic Kilcher 🇸🇨 · Apr 27, 2023 · 9:57 PM UTC

Yannic Kilcher 🇸🇨

@ykilcher

27 Apr 2023

🌏New Video🌎 Scaling Transformers to 1 MILLION tokens and beyond. We'll take a look at what lies behind the Recurrent Memory Transformer and see whether it lives up to the hype. Watch here: piped.video/4Cclp6yPDuw

282

34,852

Yannic Kilcher 🇸🇨 · Feb 17, 2021 · 4:12 PM UTC

Yannic Kilcher 🇸🇨

@ykilcher

17 Feb 2021

🎉New Video🎉TransGAN is the first successful attempt at building GANs with NO convolutions🔥Generator and Discriminator are Transformers (of course)👀Watch now to find out what 3 tricks make it all work!🧙(#3 will surprise you ;)) piped.video/R5DiLFOMZrc @CodeTerminator

275

Yannic Kilcher 🇸🇨 · Mar 11, 2021 · 5:20 PM UTC

Yannic Kilcher 🇸🇨

@ykilcher

11 Mar 2021

👉Video Out Now👈Self-Supervised Learning: The Dark Matter of Intelligence by @ylecun,@imisra_,@facebookai: "We believe that SSL is one of the most promising ways to [...] approximate a form of common sense in AI systems."🔥Watch to learn more! piped.video/Ag1bw8MfHGQ

272

Yannic Kilcher 🇸🇨 · Aug 28, 2020 · 1:06 PM UTC

Yannic Kilcher 🇸🇨

@ykilcher

28 Aug 2020

New Video 🥳 Transformers are coming for Images 😱 Axial-DeepLab combine learned Positional Embeddings w/ Axial Attention and get SotA on Segmentation with a fully Attentional model! No Convolutions 🐐 piped.video/hv3UO3G0Ofo @YuilleAlan @imadamtm @JohnsHopkins @GoogleAI

273

Yannic Kilcher 🇸🇨 · Mar 16, 2021 · 6:21 PM UTC

Yannic Kilcher 🇸🇨

@ykilcher

16 Mar 2021

🔥New Video🔥Do Transformers learn universal computation primitives? GPT-2 pre-trained on language can transfer to vision while COMPLETELY FREEZING all attention weights🤯Only .1% of parameters tuned👀 piped.video/Elxn8rS88bI @_kevinlu @adityagrover_ @pabbeel @IMordatch

269

Yannic Kilcher 🇸🇨 · Apr 1, 2022 · 9:29 AM UTC

Yannic Kilcher 🇸🇨

@ykilcher

1 Apr 2022

YouTube's format just doesn't lend itself to educational long-form content anymore. I will henceforth do my paper reviews on TikTok.

267