🥳Special Video🥳This has been in the works for a while. I used CLIP + BigGAN to make a music video for a song with lyrics made from ImageNet class labels🤠"Be my weasel", performed by me on a looper🎸Code & references available, make your own! Enjoy🤟 piped.video/rR5_emVeyBk
44
107
709
GPT-4 paper literally is just saying "we trained a model on data and it's better". Spread over 98 pages.
72
236
3,003
370,112
🔥EVERYONE🔥We’re excited to announce the release of OpenAssistant. The future of AI development depends heavily on high quality datasets and models being made publicly available, and that’s exactly what this project does. Watch the annoucement video: piped.video/ddG2fM9i4Kk
47
468
2,001
928,705
"neural networks focus too much on texture"
The prior in your brain is wrong. This isn’t fried chicken
21
154
1,277
Asking for more regulation is a classic move of market leaders to suppress all competition. How petty of OpenAI to sink to this level.
Sam Altman, CEO of the start-up behind the AI chatbot ChatGPT, agreed with members of the Senate on Tuesday on the need to regulate increasingly powerful AI technology. nyti.ms/435J9Qt
71
158
1,275
211,294
So the M stands for...?
152
38
1,060
276,921
Ok hear me out, each person who leaves OpenAI just has to memorize a billion weights then we get GPT-4
30
39
1,010
99,623
🔥New Video🔥 I delve (ha!) into Byte Latent Transformer: Patches Scale Better Than Tokens where the authors do away with tokenization and create an LLM architecture that operates on dynamically sized "patches" instead of tokens. By controlling the patch size, they gain a level of control over the tradeoff between model size and FLOPs and use that to achieve more favorable scaling behavior than classically tokenized LLMs. Watch here: piped.video/loaTGpqfctI
12
101
1,013
83,809
There are 4 big Machine Learning conferences now: NeurIPS, ICML, ICLR, and Google I/O
9
115
958
This is petty
Elon Musk wanted an OpenAI for-profit. openai.com/index/elon-musk-w…
46
6
971
182,073
it's a doozy
26
115
882
We must urgently stop all further development on this new "keyboard" technology. In the near future, anyone will just be able to type anything!!! The world will be flooded with fake news and civilization will fall😱
48
95
829
277,073
🔥New Video🔥Convolutions are DEAD as Transformers continue to ruin absolutely everything 😱 New SotA on ImageNet, VTAB, etc using only Transformers + massive data 👑 Also Peer Review is broken. Watch Now!👀 piped.video/TrdevFK_am4 @GoogleAI @giffmana @__kolesnikov__ @XiaohuaZhai
18
179
827
NEW & BREAKING: A Sharpie engineer has spent months testing its "pen" product. He's disturbed by the violent/sexual content it can create & Walmart's decision not to take it off the shelves to investigate.
NEW & BREAKING: An Adobe engineer has spent months testing its image-generator software, Photoshop. He's disturbed by the violent/sexual content it can create & Adobe's decision not to take it offline to investigate.
23
66
703
148,082
New Video 🔥 Deep Learning is very good at fitting functions numerically, but what about deriving symbolic expressions? How Graph Networks can learn Newtonian Physics and Dark Matter! piped.video/LMb5tvW-UoQ @MilesCranmer @PeterWBattaglia @KyleCranmer @DavidSpergel @cosmo_shirley
12
155
659
🔥New Video🔥 "Isn't Mamba just a fancy LSTM?" - turns out, there are some key differences! This video is a close look at selective state spaces: piped.video/9dSkvxS2EB0
9
76
645
49,469
"Grokking" is weird: Neural Networks trained to fill in binary operation tables will quickly overfit to the training data, but after many, many steps suddenly "get it" and achieve 100% validation accuracy. piped.video/dND-7llwrpw
16
112
621
Stable Diffusion had a good run. It was the cool new kid on the block. Sadly, it's now in TensorFlow. Have fun with it boomers...
Stable Diffusion implemented using @Tensorflow and #Keras. - Converted pre-trained models - Easy to understand code - Minimal code footprint Code : github.com/divamgupta/stable… Google Colab with @Gradio demo : colab.research.google.com/dr…
21
35
622
This model learns, unsupervised, to translate code from Python to C++, including standard library calls and type inference! 👀 Watch this video to find out how! piped.video/xTzFJIknh7E @MaLachaux @b_roziere @LowikChanussot @GuillaumeLample @facebookai
15
144
572
Another sad day for open source. I personally wrote the first version of token-streaming for this.
Today is a huge milestone for one of our latest libraries: Text Generation Inference - we released v1.0 and under a new license: HFOIL 1.0 github.com/huggingface/text-… This 🧵 explains what this new license means, and why the change!
16
60
565
244,946
To the contrary, people don't forget GPT-2. People vividly remember that, quite unprecedented, OpenAI refused to share code or weights for GPT-2 and single handedly started an era of closed models and commercialism over science.
And just like that, @OpenAI gpt-oss is now the number one trending model on @huggingface, out of almost 2M open models 🚀 People sometimes forget that they've already transformed the field: GPT-2, released back in 2019 is HF's most downloaded text-generation model ever, and Whisper has consistently ranked in the top 5 audio models. Now that they are doubling down on openness, they may completely transform the AI ecosystem, again. Exciting times ahead!
21
33
593
49,680
Men who use LARGE language models, is it possible that you're compensating for something?
42
25
553
59,707
This is the worst AI ever! I trained a language model on 4chan's /pol/ board and the result is.... more truthful than GPT-3?! See how my bot anonymously posted over 30k posts on 4chan and try it yourself. Watch here (warning: may be offensive): piped.video/efPrtcLdcdM
34
76
547
The AI ethics community is dead. It has no more power. This is good because it was never about ethics. Next are the effective altruists, most of which are neither effective nor altruistic. Sanity will win
31
42
551
56,044
Apparently, Stanford is putting together a strongly worded letter against me. I'm not kidding. A strongly worded letter.
51
7
549
🔥New Video🔥 Flow matching (not classic diffusion) is the basis for state-of-the-art text to image models, like Stable Diffusion 3. Here is how it works: piped.video/7NNxK3CqaDk
4
84
536
40,365
JavaScript be like "==" the same "===" really the same "====" really, actually the same "=====" you won't even believe how the same those things are
11
42
512
Programming is now just arguing with models.
20
42
509
49,085
Conclusion: If we make the car bigger, it will probably work.
When you debug a machine learning model
19
44
511
GPT-3 is out and it is HUGE 🤯 Turns out that a pure Language Model can zero-shot almost any NLP Task! Here's my video summary of this 175 BILLION parameter beast! piped.video/SY5PvZrJhLE @nottombrown @8enmann @AlecRad @Dario_Amodei @arvind_io @girishsastry @AmandaAskell @ilyasut
12
141
521
NeurIPS introduces a track dedicated to advancing kids of rich parents even more than they already are
NeurIPS 2024 will have a track for papers from high schoolers.
19
33
509
93,885
I have a secret for you... #manufacturedoutrage
Will releasing the weights of large language models grant widespread access to pandemic agents? Turns out, yes, probably. 1/5
18
22
494
81,328
According to the current AI landscape, Microsoft Word is Open Source, because I can use it for free as a student.
14
26
462
56,095
🔥New Video🔥 RWKV takes the best of both worlds: Transformers and RNNs and combines them into a scalable architecture that is refreshingly different. This video dives deep into how it works and where its tradeoffs are: piped.video/x8pW19wKfXQ
11
74
471
61,572
To all "it's merit based" responders: if you reward skills before they are introduced in the public school system, the vast majority of rewardees will come from extremely privileged backgrounds that support and incentivize them to acquire those skills privately.
NeurIPS introduces a track dedicated to advancing kids of rich parents even more than they already are
19
31
467
45,860
Shocking: A trained model beats an untrained model. It's 2023 everyone 😁
Goat: Fine-tuned LLaMA Outperforms GPT-4 on Arithmetic Tasks introduce Goat, a fine-tuned LLaMA model that significantly outperforms GPT-4 on a range of arithmetic tasks. Fine-tuned on a synthetically generated dataset, Goat achieves state-of-the-art performance on BIG-bench arithmetic sub-task. In particular, the zero-shot Goat-7B matches or even surpasses the accuracy achieved by the few-shot PaLM-540B. Surprisingly, Goat can achieve near-perfect accuracy on large-number addition and subtraction through supervised fine-tuning only, which is almost impossible with previous pretrained language models, such as Bloom, OPT, GPT-NeoX, etc paper page: huggingface.co/papers/2305.1…
14
16
461
85,757
It's surprisingly fun to collect data for OpenAssistant - Our open-source alternative to ChatGPT! Check out the video: piped.video/64Izfm24FKA #openassistant #chatgpt
19
90
462
86,417
Who is behind #StableDiffusion? Check out this interview with @EMostaque , Founder of Stability AI. We chat about open sourcing models, building a giant compute cluster from scratch, and how he envisions a true democratization of AI. piped.video/YQ2QtKcK2dA @StableDiffusion
8
81
448
AI Ethics people just mad I Rick rolled them.
27
15
418
🔥Short Video🔥MLP-Mixer by @GoogleAI already has about 20 GitHub implementations in less than a day. An only-MLP network reaching competitive ImageNet- and Transfer-Performance due to smart weight sharing! Check it! piped.video/7K4Z8RqjWIk @neilhoulsby @giffmana @__kolesnikov__
5
73
437
New Video 🥳 This paper uses lots of compute to learn a single, unified LSTM-based optimizer on over 6000(!) different tasks, then uses that optimizer to TRAIN ITSELF! We're in full meta-land 😱 piped.video/3baFTP0uYOc @GoogleAI @Luke_Metz @niru_m @bucketofkets @poolio @jaschasd
2
94
410
Oh yes, "neural networks", that one algorithm 😁
There are thousands of machine learning algorithms out there, but you'll rarely need more than a handful. A good start: • Linear/Logistic Regression • Decision Trees • Neural Networks • XGBoost • Naive Bayes • PCA • KNN • Random Forests • K-Means
14
20
403
🔥New Video🔥an analysis of @karpathy's talk about Tesla's full self-driving system, using NOTHING BUT VISION🤯 Major themes: Auto-labelling to collect data, careful detection of edge-cases, and the massive benefits of owning the entire pipeline💪 piped.video/9MJTeOaSMTk
5
60
415
A strategy that never fails: Reel them in with the hype, then, when they least expect it, educate them! piped.video/nOBm4aYEYR4
10
28
408
45,150
🔥New Video🔥Decision Transformers gets remarkably good performance on Offline RL by just ditching everything RL and using sequence modeling🤯Check it piped.video/-buULmf7dec @lchen915 @_kevinlu @aravindr93 @kimin_le2 @adityagrover_ @MishaLaskin @pabbeel @AravSrinivas @IMordatch
11
81
397
🔥New Video🔥 LambdaNetworks capture long-range interactions as linear functionals🤯 Super complicated, basically Transformers without the giant memory requirements 🥳 New SotA on ImageNet! 💪 Watch Now! piped.video/3qxJ2WD8p4w #ICLR2021
8
73
405
How do Machine Learners diet? They turn on weight decay.
15
40
396
New Video 🔥 How I Read A Machine Learning Paper Here's my process of reading and understanding the DETR object detection paper in an efficient manner. piped.video/Uumd2zOOz60
9
79
398
🔥Special Video🔥I built a Neural Network in Minecraft 🎲No Mods, No Command Blocks 📶Analog, not Digital ⛏️Backpropagation & Weight Updates ⚙️Fully Automatic 🧑‍💻Open Source This video details what it does, how it works, and how it's built. Don't miss it 😉 piped.video/7OdhtAiPfWY
16
88
385
Replying to @lexfridman
Continued by GPT-3: "2. Those who cannot In the first case, the person is a scientist. In the second case, the person is a journalist."
9
8
391
I am [x] pro vaccine [x] anti excessive government pressure yet when I protest the latter, I'm immediately lumped in with the antivax crowd. My opinion is not registered anywhere because people like me just don't speak up, and I feel I'm not the only one. Who else feels this way?
57
24
387
Join me for this video👯We take a look at @facebookai's DINO architecture, pushing Self-Supervised Learning for Vision Transformers to truly impressive levels🔥🔥🔥 Check it out! piped.video/h3ij3F3cPIk @julienmairal @armandjoulin
3
73
390
I asked this person twice already for an actual, concrete instance of "harm" caused by gpt-4chan, or even a likely one that couldn't be done by e.g. gpt-2 or gpt-j (or a regex for that matter), but I'm being elegantly ignored 🙃
This week an #AI model was released on @huggingface that produces harmful + discriminatory text and has already posted over 30k vile comments online (says it's author). This experiment would never pass a human research #ethics board. Here are my recommendations. 1/7
33
19
359
Google has a weird definition of "shared".
1/ In 2021, we shared next-gen language + conversation capabilities powered by our Language Model for Dialogue Applications (LaMDA). Coming soon: Bard, a new experimental conversational #GoogleAI service powered by LaMDA. blog.google/technology/ai/ba…
9
17
377
76,225
New👏Video👏NFNets achieve new ImageNet SotA by DROPPING batchnorm😱They train 9 times faster than EfficientNet and excel at transfer learning🔥Code is available, too💪Watch now & don't miss some spicy comments from me😄 piped.video/rNkHjZtH0RQ @ajmooch @sohamde_ @SamuelMLSmith
13
68
391
New Video 🥳 Modern Hopfield Networks can store & retrieve exponentially many patterns and have a surprising and intricate connection to Transformer Attention Mechanism! 🔥 piped.video/nv6oFDp6rNQ @HRamses2 @MichaelWidrich @milenapavl @SandveGeir @victorgreiff @jbrandi6 @LITAILab
6
89
377
Yesterday I released a video going over V-JEPA, how it works, and why it matters (including a recap of the original JEPA). Watch here: piped.video/7UkJPwz_N_0
4
36
376
45,138
Replying to @percyliang
Could you please at least link the video in the letter so people can make up their own mind?
11
6
346
It's 2025. MLP-Supermixer-200T outperforms every human at every task. ... "bUt DoEs It ReAlLy UnDeRsTaNd AnYtHiNg?"
16
23
360
I've made a video explaining xLSTM. Watch here: piped.video/0OaEv1a5jUM
5
47
362
32,685
🔥New Video🔥GLOM is @geoffreyhinton's new Computer Vision idea🥳The model represents part-whole hierarchies into implicit parse trees via a multi-step attention-based consensus algorithm👀Excited? Me too! Watch the video to find out more!👇 piped.video/cllFzkvrYmE @GoogleAI
8
55
357
🔥New Video🔥 @DeepMind AlphaFold 2 delivers major AI breakthrough in Protein Folding🧬Beats all competition by HUGE margins🤯Watch to learn how AlphaFold 1 works and what we can guess about AlphaFold 2💪 (Hint: Transformers 😉) piped.video/B9PL__gVxLI @demishassabis #AlphaFold2
7
51
350
🔥New Video🔥How to backpropagate through an algorithm? Seems crazy, but this paper shows it's actually possible for a large class of algorithms, such as k-subset, ILP, and many graph algorithms. Watch my (amateur 🙃) attempt at an explanation here: piped.video/W2UT8NjUqrk
2
66
347
Give us the models? 🤷‍♀️
If you are a developer using the @OpenAI API, DALL-E, ChatGPT, etc. what can we do to make the developer experience better? 🧵👇
6
20
340
41,250
Turns out loading models from the hub (or any other place) is ⚠️ NOT SAFE ⚠️ and opens you up to arbitrary code execution by an attacker🤯 Learn how to do it yourself (and how to protect against it) in this video: piped.video/2ethDz9KnLk
9
55
339
New Video 🔥 No more O(N^2) complexity in Transformers: Kernels to the rescue! 🥳 This paper makes Attention linear AND shows an intriguing connection between Transformers and RNNs 💪 piped.video/hAooAOFRsYc @angeloskath @apoorv2904 @nik0spapp @francoisfleuret @EPFL_en @Idiap_ch
4
74
342
🔥New Video🔥This almost seems like magic🪄DeepMind's AlphaTensor finds new algorithms for doing matrix multiplication that use less multiplication operations(!) than any algorithm humans have discovered so far. Watch here to see how they do it 👇 piped.video/3N3Bl5AA5QU
4
59
337
How to make your CPU as fast as a GPU? 🔥 Nir Shavit explains how clever algorithms can make use of sparsity in neural networks to deliver unprecedented inference speed, without any need for specialized hardware! Watch here: piped.video/0PAiQ1jTN5k
5
51
329
One month from now: SotA on ImageNet by really large logistic regression on patches.
11
16
324
Computer Vision just got an Upgrade 🔥 SpineNet is a smaller, better and faster replacement to ResNet by @GoogleAI obtained using Neural Architecture Search 💪 Watch the Video 👀 piped.video/qFRfnIRMNlk @Phyyysalis @tanmingxing @YinCui1 @quocleix Thumbnail Art by Lucas Ferreira!
3
71
328
🎉ML News: Generative MEGA-Models🎉 - Google PaLM: Amazing 540B Transformer - OpenAI DALL-E 2: Text-to-Image breakthrough - Open CLIP, open VQGAN diffusion, open datasets - Salesforce CodeGen - ...and the surprises one finds in Zurich 😉 piped.video/RJwPN4qNi_Y
5
46
323
Meta, you did almost everything right. Now grow a pair and keep that demo up.
15
8
306
New Video on the recently released Mixtral of Experts paper. We look into sparse mixture of experts routing, and note the distinct absence of any mention whatsoever where the training data came from. Watch here: piped.video/mwO6v4BlgZQ
6
34
319
29,314
People who advocate for "safe" LLMs sometimes don't consider what this word means to other people
Just checking in on alignment of LLMs in China, it's going about how you'd expect.
15
20
305
35,681
🥳Special Video🥳You've just started PhD have no clue what to do? Welcome to the club🙂A Survival Guide for PhDs in Machine Learning🧑‍🔬How to do topic selction, conferences, paper writing & what I learned from many mistakes👍Watch, Like, Share🔥Thank You piped.video/rHQPBqMULXo
9
61
306
Google search is now illegal
BREAKING: California’s newly passed AI bill requires models trained with over 10^26 flops to — not be fine tunable to create chemical / biological weapons — immediate shut down button — significant paperwork and reporting to govt
14
29
302
50,819
🔥Here we go🔥 The first OpenAssistant models are out! We have collected the most amazing human dataset ever and it shows: This model is really cool! Watch the video to see it in action and come give it a try: piped.video/Hi6cbeBY2oQ
16
60
306
68,393
👉Paper Explained Video👈Today: @DeepMind's new Perceiver model solves Transformers' quadratic bottleneck by using cross-attention into a self-attentive RNN backbone🦴Can attend to 50k pixels at once!👀Watch Now! piped.video/P_xeshTnPZg @drew_jaegle @OriolVinyalsML @joaocarreira
9
53
306
🎉 New Video 🎉 Knowledge Graphs are very expensive to make, they need human experts. Or do they? 🧐 What if we replaced them with BERT or GPT-2? 🤯 Turns out, works really well, all without training! 🥳 piped.video/NAJOZTNkhlI @ChenguangWang @dawnsongtweets @ShawLiu12 #AI #NLP
4
79
293
Oh no! Now we will be flooded with fictional plays that are COMPLETELY MADE UP!!1!
Introducing Dramatron, a new tool for writers to co-write theatre and film scripts with a language model. 🎭 Dramatron can interactively co-create new stories complete with title, characters, location descriptions and dialogue. Try it yourself now: dpmd.ai/dramatron-github
11
16
294
A joke. It's called a joke. Oh the things people can get offended by 🤦🏽‍♀️
This is one reason why people are afraid of contributing to the community -Divam did a great job! spent their time creating something super cool and shared with everyone -Just to have someone come and shi* on their head for no reason! This is very sad! Don't be like that!
35
4
296
🚀New Video🚀 ReST bootstraps its own extended dataset and trains on ever higher-quality subsets of it. Re-using generated data multiple times means an efficiency advantage with respect to Online RL techniques like PPO. Watch here: piped.video/V4dO2pyYGgs
5
55
301
39,285
To make things even better, we are making this entire dataset free and accessible to all who wish to use it. Check it out today at huggingface.co/OpenAssistant ! 🎉
5
43
286
44,466
🔥New Video🔥 EVERYBODY is talking about @OpenAI's new DALL·E model 👀 It takes any piece of text and turns it into an image, absolutely crazy 😱 Watch the video to learn more💪 piped.video/j4xgkjWlfL4 #DALLE @ilyasut @_jongwook_kim @MikhailPavlov5 @gabeeegoooh @scottgray76
6
55
292
🔥New Video🔥 FFT Magic🪄Fourier Neural Operators speed up PDE solvers by orders(!) of magnitude 🤯 Trained once, solve entire PDE families for any discretization!🎉Watch to find out more⏭️ piped.video/IaS72aHrJKE @ZongyiLiCaltech @kazizzad @AnimaAnandkumar @Caltech #ai #science
6
38
291
🔥New Video🔥 Linear Attention! Unbiased Estimator! Random Features! Orthogonal Features! Low Variance! Tight Bounds! Kernels! Backw. Compatible! The PERFORMER has it all🤯 Watch!💪 piped.video/xJrKIPwVwGM @XingyouSong @kchorolab @andreea_gane @lukaszkaiser @dmdohan @CambridgeMLG
4
42
282
There is already a Switzerland of AI. It's called Switzerland
Brexit may yet turn out to have been a good idea, if it means the UK can be the Switzerland of AI.
10
11
290
50,449
No son of a construction worker is just going to randomly start doing ML research if they never hear of it and don't get told that it could be important for their future career, no matter how intelligent the kid is
16
9
275
14,373
Been there done that 😁
Replying to @OpenAI
Bfd, Grok is partnering with 4Chan mfw
12
4
283
40,079
How to make money with NFTs: 1. Buy an NFT 2. Use it as a reminder for the rest of your life to not make shitty decisions.
2
24
279
my_opinions != your_opinions my_opinions = !your_opinions important difference
11
26
270
Ok I get it, I'm not not the favorite child 😁
Replying to @lexfridman
Thanks Lex, great video!
10
1
271
🌏New Video🌎 Scaling Transformers to 1 MILLION tokens and beyond. We'll take a look at what lies behind the Recurrent Memory Transformer and see whether it lives up to the hype. Watch here: piped.video/4Cclp6yPDuw
3
35
282
34,852
🎉New Video🎉TransGAN is the first successful attempt at building GANs with NO convolutions🔥Generator and Discriminator are Transformers (of course)👀Watch now to find out what 3 tricks make it all work!🧙(#3 will surprise you ;)) piped.video/R5DiLFOMZrc @CodeTerminator
5
43
275
👉Video Out Now👈Self-Supervised Learning: The Dark Matter of Intelligence by @ylecun,@imisra_,@facebookai: "We believe that SSL is one of the most promising ways to [...] approximate a form of common sense in AI systems."🔥Watch to learn more! piped.video/Ag1bw8MfHGQ
6
54
272
New Video 🥳 Transformers are coming for Images 😱 Axial-DeepLab combine learned Positional Embeddings w/ Axial Attention and get SotA on Segmentation with a fully Attentional model! No Convolutions 🐐 piped.video/hv3UO3G0Ofo @YuilleAlan @imadamtm @JohnsHopkins @GoogleAI
2
70
273
🔥New Video🔥Do Transformers learn universal computation primitives? GPT-2 pre-trained on language can transfer to vision while COMPLETELY FREEZING all attention weights🤯Only .1% of parameters tuned👀 piped.video/Elxn8rS88bI @_kevinlu @adityagrover_ @pabbeel @IMordatch
4
54
269
YouTube's format just doesn't lend itself to educational long-form content anymore. I will henceforth do my paper reviews on TikTok.
13
3
267