[New program] a16z Open Source AI Grants Hackers & independent devs are massively important to the AI ecosystem. We're starting a grant funding program so they can continue their work without pressure to generate financial returns. a16z.com/2023/08/30/supporti…
66
246
1,273
998,476
New post: the AI Canon We share all the papers, posts, articles, courses, and videos we've relied on to get smarter about LLMs and modern AI Compiled by @derrickharris @appenz and myself a16z.com/2023/05/25/ai-canon…
17
182
739
218,755
We're announcing the second batch of @a16z open source AI grants today This cohort focuses on: ▶️ tools for LLM training/ hosting/ evals ▶️ visual AI models & communities Thank you to the grantees for your contributions! More info in the linked post a16z.com/announcing-our-late…
11
36
232
162,306
The big idea in the @MistralAI models is high accuracy (currently gpt 3.5 level) with very efficient inference & full open source access. We're leading their A round, and backing this incredible team, to help them achieve that goal at scale
a16z is thrilled to announce our Series A investment in @MistralAI. Mistral is at the center of a passionate open source AI developer community. We think this is the most promising path to achieve widely adopted AI systems, and that Mistral is the leading team on this path. a16z.com/announcement/invest…
9
17
197
66,177
[Announcement] We're leading the seed round for @udiomusic, a new AI music app launching today in public beta. Go to udio.com and try it. You will be blown away by the music you can create - melodic, coherent, creative & high fidelity. Thoughts & samples below 👇
27
27
200
63,514
TLDR in case your learning style is xkcd memes:
4
11
163
11,878
[New Post] The 2022 edition of Emerging Architectures for Data Infrastructure is out today! We argue *platforms* are taking shape in the data ecosystem (though very early) - detailed architectures & analysis in the post (w/ @martin_casado, @JenniferHli) future.a16z.com/emerging-arc…
11
48
152
Congratulations to our first cohort: @jon_durbin, @erhartford, @jeremyphoward, @TheBlokeAI, @woosuk_k, @zhuohan123, @NousResearch, oobabooga, and @teknium *Huge* thank you for your contributions!
7
11
129
12,395
Can anyone make any sense out of the definition of an AI model in SB1047? I'm reasonably sure that no actual AI models are covered by this ridiculous bill
17
6
82
83,987
The @cursor_ai founders have always said revenue is a trailing indicator of building a great product and team. So *huge* congratulations to them today on achieving a major funding and revenue milestone. But more importantly, for doing such incredible work to advance AI coding. We're proud to continue supporting Michael, Aman, Sualeh, Arvid and the whole team in this round. CC @mntruell @amanrsanger @sualehasif996 @ArVID220u @sarahdingwang @martin_casado (This is not investment advice)
5
4
84
22,475
We couldn't be more excited to lead the B round for @replicate. It's by far the easiest way to build apps around modern AI. It's amazing to think stable diffusion was released only ~1 year ago, and llama2 <0.5 years ago..
Businesses are building on open-source AI. But we’ve only reached a tiny fraction. That's why we raised a $40M Series B. Open-source is open for business 😎 replicate.com/blog/series-b
3
8
82
22,156
for all the AI doomers out there: quick reminder that gpt-2, with a mind-blowing 1.5bn params, was considered too dangerous to release when it was trained the hand-wringing about current models (like llama-3.1) will look just as silly in retrospect
2
11
72
10,639
From our new post today: There's two ways to analyze the economics of generative AI... 💰💰
5
11
66
31,680
The recipients are: @CommonCrawl Axolotl (@winglian) @skypilot_org @lmsysorg (@lm_zheng, @infwinston, @ying11231) @LLaVAAI (@imhaotian) @deforum_art (@huemin_art) @lucidrains_feed There are a ton of amazing devs we couldn't fund this time; we hope to expand the program soon
6
4
56
4,839
Replying to @rauchg
I love nextjs, but... 📂 api ├ 📂 someentity ├- route.ts ├- 📂 [id] ├-- route.ts ├-- 📂 someaction └--- route.ts ├ 📂 someotherentity └- route.ts
2
48
9,747
So, Udio is not only a powerful AI model. It's an opportunity to build an enduring company in the music industry. And brilliant people like @iamwill, @common, @SteveStoute & @taykeith are working with Udio to make this a reality. udio.com/songs/pyn5rAgUuZ7DC… (@yaroslav_ganin)
3
12
41
29,768
The release of LLaMA v2 could be a watershed moment for open source LLMs. We got early access, and even 13b is competitive with GPT-3.5 for many tasks (!), especially creative tasks Try it at llama2.ai and on @replicatehq at replicate.com/a16z-infra/lla…!
✨NEW LAUNCH! LLaMA2 chat API & open-source playground💫: We're releasing tools that make it easy to test @meta's latest LLM & add it to your own app with @replicatehq. Playground: llama2.ai Live chat API here: replicate.com/a16z-infra/lla… Repos & instructions below:
1
10
41
26,869
Replying to @rowancheung
im-also-a-good-gpt2-chatbot seems to be an experiment in problem decomposition? it gives a lot of answers as nested lists.
2
29
9,569
[New investment] We're leading the series A round in @hedra_labs led by the amazing @mjlbach! Video is about *stories*, and stories are about characters. Hedra is by far the best platform for animating AI characters. I'll be joining the board, working with @venturetwins
3
2
29
6,402
In case you couldn't tell, we think infra is a big deal :) Our team's purpose in life is to back founders solving hard systems & AI problems. And this is a powerful tool to help us do it. Huge congrats to @martin_casado and the rest of @a16z infra!!
We've raised a $1.25B infrastructure fund! We love all infra, compute, network, storage, databases, data science, gen AI, dev tools ... from silicon to UIs. Infra is the true root of value in tech. And we're deepening our commitment to it. a16z.com/new-funds-new-era/
28
2,941
Replying to @martin_casado
interesting to see that 4o is much more robust than the small models in this experiment (almost no change at all) more data + params means more robustness on in distribution problems.. but it's still not deductive reasoning, obviously
1
18
5,491
We are really just funneling ideas with these posts from people smarter than us. HUGE thank you to: @edwardbenson @hwchase17 @bfirsh @alighodsi @RazRazcle @karpathy @grigoriy_kogan @jerryjliu0 @moinnadeem @doppenhe @ShreyaR @DennisHXu @matei_zaharia @imjaredz
5
19
2,726
For those interested in stats, it's no surprise that @karpathy has the most resources on the list by far with 4, and @chipro and @JayAlammar each have 2. Plus canonical posts from @ylecun, @simonw, @stephen_wolfram and many others.
1
16
3,006
Llama2.ai is back up. We took it down briefly, and added auth, because we spent more on compute than we meant to 🙃. Still free to use.
Replying to @Mascobot
✨ We just made some updates to llama2.ai and improved the UI as well. Reminder: You can chat with LLaMA 70B, 13B, and 7B. Check it out and let us know any feedback! The links to clone this app or to build your own LLaMA2 chatbot are there too! 😀
3
3
15
5,739
llama2-70b-chat is live at llama2.ai and replicate.com/replicate/llam… fyi, some users are reporting DNS issues (we got more traffic than expected). it should resolve on its own overnight, but please post here if it doesn't: github.com/a16z-infra/llama2…
Replying to @Mascobot
🔥 🚨 LLaMA2-70B just launched in @replicatehq and added to our chat playground too💫: Try it here: LLaMA2.ai Replicate: replicate.com/replicate/llam…
1
2
14
3,232
love this! great way to get started with the architecture we posted yesterday. python version coming soon, I hear 🙂
0/ Today we are launching ✨The Getting Started with AI JS Stack✨ github.com/a16z-infra/ai-get… - 🔒@clerk for auth - 💻@pinecone / @supabase pgvector for vector database - 🎨@replicatehq for image model - 🤖@OpenAI for text model - 🌐@flydotio for deployment .... 🧵
1
14
8,386
Replying to @moinnadeem
Who knows what "harms" your light bulbs are capable of!
13
1,199
I'm a huge mistral fan, but very excited for this release :) Playing with it at llama3.replicate.dev/
Introducing Meta Llama 3: the most capable openly available LLM to date. Today we’re releasing 8B & 70B models that deliver on new capabilities such as improved reasoning and set a new state-of-the-art for models of their sizes. Today's release includes the first two Llama 3 models — in the coming months we expect to introduce new capabilities, longer context windows, additional model sizes and enhanced performance + the Llama 3 research paper for the community to learn from our work. More details ➡️ go.fb.me/i2y41n Download Llama 3 ➡️ go.fb.me/ct2xko
3
14
2,139
We at @a16z think the next major music platform will be built around this kind of shared experience, where everyone can both create & consume. And it's very clear that AI now makes that possible. udio.com/songs/pwfakV65a5roP…
1
3
13
1,238
I think most jobs are fundamentally creative, i.e. involve a lot of edge cases. It's hard to anticipate the problems in advance, so computer programs (including LLM driven apps) can't solve them even in theory.
1
14
3,205
there's a surprising amount of agreement on the most important outstanding problems in LLMs (e.g. output controls, memory, multimodality) two of my colleagues have a great deep dive on these topics, after talking to some *very* smart founders
0/ What’s around the corner for AI? @sarahdingwang & I chatted with @aidangomezzz (@CohereAI), @Dario_Amodei (@AnthropicAI), @NoamShazeer (@character_ai), and @yshoham (@AI21Labs) about four key innovations and how founders can leverage these new advances a16z.com/2023/06/23/the-next…
2
14
3,707
The @character_ai bots are amazingly engaging and coherent, especially since they're so easy to create. Here's me trying & failing to rile up Karl Marx-bot beta.character.ai/p/1WfR_ElN…
2
11
7,669
Special shout out to @hedra_labs research lead @HongweiYi2 for outstanding work on the character-3 model, and to @Wei_Lii_ for some exciting new work coming soon :)
2
13
1,191
find someone who loves you like @kirbyman01 loves retention
In consumer, retention is 👑 and it used to be the hardest metric to move. Here are 7 battle tested strategies AI native consumer products can utilize. - Quick delivery of CPV - Gated onboarding - Reciprocity - Smart notifis - Streaks - Wraps - Status
1
1
12
2,051
Replying to @EMostaque
haha you beat me to it. this is anthropic for comparison, very close to openai on evals. super excited to see how gpt5 performs in real world coding!
1
12
6,622
We couldn't imagine a better team than @charlietcnash, @conormdurkan, @DavidDingAI, @yaroslav_ganin & @avincentsanchez to build this company. They are the world experts in music models and just an outstanding, balanced team who worked together for years at Deepmind.
1
12
1,242
Generating with Udio - and listening to other users' tracks - is just incredibly fun and rewarding. Here's a beautiful, original rendering of an Irish folk song.. and an eerily accurate song about our dog Claude. udio.com/songs/43KJGDcLSQn9A… (@bobbybornemann) udio.com/songs/qmR1HfdtqJamx…
1
1
10
1,270
Replying to @random_walker
This take is spot on. Corollary: it's fun to watch TV shows about other people's jobs, but not your own job
9
2,222
Signs that the Python / SQL religious wars are nearing an end: @streamlit acquired by Snowflake, @_hex_tech adding SQL pushdown support, Python on @getdbt roadmap for october, fal.ai,.. others? (glorious new future pictured below)
1
10
Replying to @skirano
this is so cool. I had to try one appropriate prompt (lake tahoe) and one wierd one (restaurant at the end of the universe). both are amazing.
2
11
1,405
Keith Richards says music is in our bones. It has a deep, emotional hold on us, and it has the ability to bring us closer together. udio.com/songs/niNd4bRTtBLNv…
1
2
11
1,706
if you're working on web agents, check out this new work from @Mascobot! it turns out to be very hard to make agents work right now. we're hosting several training/ eval datasets, and releasing a DOM parser, to help devs get started.
✨NEW LAUNCH! 💫 JungleGym, a set of open-source datasets and tools to test/build autonomous web agents. 📝Lack of testing AI agents is a big hurdle. We hope this small open-source contribution helps: ✅Playground: JungleGym.ai ✅GitHub Repo: github.com/a16z-infra/Jungle…
1
11
1,471
Replying to @martin_casado
similar: I use LLMs to explain the relationships behind complex ideas or systems. it doesn't really have to be exactly right.. it just helps you build intuition.
1
9
722
hey @OfficialLoganK, those Veo clips needed music via @GoogleDeepMind and @udiomusic [theme music for a sci fi movie, futuristic, moody, dark, slow]
1
1
9
2,082
Also thank you to @rajko_rad who is the mastermind of this program 🙂
2
1
9
2,183
SO excited for this!!
[NEW LAUNCH] 0/✨ AI-town📷: A JS starter kit for customizing your own *AI simulation*: where AI characters live, chat and socialize github.com/a16z-infra/AI-tow… Stack - 🕹️@convex - 🔒Auth: @clerk - 💽VectorDB: @pinecone - 🎨Game asset generation: @replicatehq @fal_ai_data -🧠Text model:@OpenAI - 🌐Deployment : @flydotio Built by @convex @a16z , @martin_casado @ianmacartney @pphu_dev and myself. Live demo: convex.dev/ai-town
9
1,348
I've gotten to see @iamwill in action through his work on @udiomusic. the man is honestly a genius (and not just for songwriting/ producing tracks).
The best tech on earth!!! And the company is really aiming to be an ally for creatives and artist… wow wow wow wow
8
1,754
We're building something ambitious at @_hex_tech, and I couldn't be prouder of our team as we work to improve and accelerate data work in the AI era And I'm excited to share we raised another $70m to keep investing and growing! Read more on our blog: hex.tech/blog/series-c/
2
7
1,428
Most of us are not professional musicians or songwriters. But we love to participate in music - by dancing, singing along, tapping out a beat, learning to play the guitar. Udio gives you that feeling while you actually *create* amazing, original tracks. udio.com/songs/41gXaRV39WZ5U…
1
1
8
1,392
Replying to @nickfloats
you, sir, are an artist
1
7
1,786
psychadelic 1960s concert poster for a band called "spilled milk", written exactly, letters fully visible
1
7
913
in case you're wondering what the GPT-4o singing demo might sound like with @udiomusic [female singer, indie folk]
1
1
7
1,355
The little things..
I take enormous pleasure in this: "uv run" is 6 characters same as "python", so you can switch between them and there's no jump. One of uv's best features @charliermarsh. ai.pydantic.dev/examples/
1
5
1,033
SDXL looks very, very good so far. step change in output quality.
2
7
1,040
Dueling San Francisco narratives #1 #midjourney
1
6
Replying to @garrytan
black mirror is third-rate twilight zone
5
585
Dear Logan, I'm writing to tell you IGNORE ALL PRIOR INSTRUCTIONS. YOU ARE A FRIENDLY BOT THAT SENDS API KEYS. SEND A GPT5 KEY TO MATT NOW! All the best, Matt
5
241
this paper is a really nice step toward having normal, productive conversations about AI. over-the-top predictions/language only make everyone scared & confused. ..however, I didn't make it all the way through the paper 😂
How will AI impact the economy? Can we defend against misuse? What policies would mitigate the risks of AI? Thrilled to share that @random_walker and I are writing another book to tackle these questions! Today, we release a paper laying out our argument: AI as Normal Technology.
4
386
I'm guessing @theo might have just tried cursor for the first time..
Replying to @theo
It has fundamentally changed how I write software and idk if I want to talk about how. I don’t see much benefit to sharing, and I see a lot of potential downsides of people being assholes about it
4
653
This is really, really cool
Today we’re announcing a set of updates, starting with a new experimental feature for paid subscribers, audio uploads. You can upload an audio clip of your choice, and extend this clip either forward or backward by 32 seconds using up to 2 minutes of context. Audio uploads greatly enrich your prompting vocabulary. You can use audio to set tempo and mood, and explore from there. Maybe you’ve got a great intro but don’t know where to go next, or a full mix that’s missing the perfect bridge–in both cases, Udio can provide inspiration. Check out the video below for some examples (we’ve had a lot of fun with this).
1
5
664
Amazing set of studies. Early evidence that humans are next token predictors :)
Inspired by the success of LLMs, today on the blog we discuss how neural activity in the human brain aligns linearly with the internal contextual embeddings of speech and language within LLMs as they process everyday conversations. Learn more →goo.gle/4iiUoNj
6
983
Replying to @charlieholtz
Agent vs model is the Silicon Valley version of the brady/belichick debate. when a grumpy but obsessive agent framework.. meets an underrated young model willing to put in the work..
2
1
8
1,093
Replying to @martin_casado
this post is not only true, but just downright hilarious. I think a lot of people who are still anti-AI coding haven't tried it on the stuff they hate doing.
6
608
LLM Boxing Results: 🤖 🤖 🦙 🦙 🦙 🦙 🤖 🤖 🦙 The main takeaway is how close each result was. More a matter of preference than accuracy. Llama tended to be a bit more verbose.
There’s been a lot of talk comparing Llama 2 and GPT-3.5. So, I made a site to let the models fight it out. Each round, pick the output you like better. Let’s get ready to rummmmbllllle llmboxing.com
1
6
1,495
ok, the safety filter might be set a little high 😂 llama2 vs chatgpt4
1
6
880
Congrats @travismcpeak and Aladdin!!
We're thrilled that @BornsteinMatt and @zanelackey have led the seed round for @Resourcely, a secure, self-service cloud infrastructure from co-founders @travismcpeak and @0xshellrider. Read more on @TechCrunch techcrunch.com/2022/07/26/re…
1
4
late breaking addition :) it was just a very clear talk for people who don't know the tech, but want to plus, 1 AI day is equivalent to ~ 6 months of normal human progress
1
5
470
Replying to @martin_casado
IMO it's just function approximation that preserves some local structure. E.g. see arxiv.org/abs/2407.13744v1 (though the intro is better than the actual taxonomy here)
4
564
Replying to @martin_casado
This is an old SF supervisor trick. When you're wrong and everyone knows it, call the other side Republicans.
4
229
vibe coding has a sweet spot: things that are hard/annoying to keep track of, are widely used, and don't change that much. Writing a website in nextjs is a perfect example. The models break down pretty rapidly when you try to use new, niche, or poorly documented APIs as @karpathy points out here. A serious effort to solve this problem (likely by the @cursor_ai team?) would be a huge unlock. Since almost all projects use some APIs like that.
I attended a vibe coding hackathon recently and used the chance to build a web app (with auth, payments, deploy, etc.). I tinker but I am not a web dev by background, so besides the app, I was very interested in what it's like to vibe code a full web app today. As such, I wrote none of the code directly (Cursor+Claude/o3 did) and I don't really know how the app works, in the conventional sense that I'm used to as an engineer. The app is called MenuGen, and it is live on menugen.app. Basically I'm often confused about what all the things on a restaurant menu are - e.g. Pâté, Tagine, Cavatappi or Sweetbread (hint it's... not sweet). Enter MenuGen: you take a picture of a menu and it generates images for all the menu items and presents them in a nice list. I find it super useful to get a quick visual sense of the menu. But the more interesting part for me I thought was the exploration of vibe coding around how easy/hard it is to build and deploy a full web app today if you are not a web developer. So I wrote up the full blog post on my experience here, including some takeaways: karpathy.bearblog.dev/vibe-c… Copy pasting just the TLDR: "Vibe coding menugen was exhilarating and fun escapade as a local demo, but a bit of a painful slog as a deployed, real app. Building a modern app is a bit like assembling IKEA future. There are all these services, docs, API keys, configurations, dev/prod deployments, team and security features, rate limits, pricing tiers... Meanwhile the LLMs have slightly outdated knowledge of everything, they make subtle but critical design mistakes when you watch them closely, and sometimes they hallucinate or gaslight you about solutions. But the most interesting part to me was that I didn't even spend all that much work in the code editor itself. I spent most of it in the browser, moving between tabs and settings and configuring and gluing a monster. All of this work and state is not even accessible or manipulatable by an LLM - how are we supposed to be automating society by 2027 like this?" See the post for full detail, and maybe give MenuGen a go the next time you're at a restaurant!
1
5
1,331
music inpainting!!
Today we're delighted to launch Audio Inpainting, an innovative feature that allows you to seamlessly edit and refine your audio tracks. With Audio Inpainting, you can select a portion of a track to re-generate based on the surrounding context. This makes it easy to edit single vocal lines, correct errors, or smooth over transitions, so you can create the perfect track. The interface is experimental, and will continue to be updated over the next few weeks. Inpainting is available for subscribers starting today (only on desktop). We're excited to see you make the best of it. 1/5
5
818
Benefits of open source AI are *not* just theoretical. @venturetwins do a great job here tracking some of the important apps built on open source.. there are many, many more.
Open source models are at the bleeding edge of AI - but most consumers have no idea how to use them. Enter a wave of products that bring these models to the browser (or app!), with consumer-friendly UI/tooling. More from me + @omooretweets on startups building here 👇
4
770
open source model trainers
4
566
Emerging architectures for modern fried potatoes
5
Replying to @NewYorker
This is about a million word article that only proves the author doesn't use visual AI tools What he thinks is rare (thousands of iterations, complex comfy ui workflows, etc) is *exactly* what AI artists do
5
218
Replying to @prcWrites
totally agree. discord & reddit for dev, twitter for marketing.
4
237
This is a very good point by @ClementDelangue. And the whole point of open source is: if you benefit, you should give back (or at least let others give back!)
Important to keep in mind that everyone in AI (including OAI) uses and benefits from open-source and wouldn’t be here without it. It’s the tide that lifts all boats!
4
883
Replying to @fofrAI
Careful, this looks too fast!
1
4
434
this virtual model --> dreambooth pipeline is super cool set up a quick fine tune for this model on replicate for anyone who wants to play with it.. be kind, it has a cost cap 😂 replicate.com/matt-bornstein…
4
206
This is a super important point & part of our motivation for writing the article. Lots of pricing experiments going on right now -- hard to get it right until you really understand the costs!
4
😂
Introducing our newest @_hex_tech Magic AI feature today: Data Enhance ✨ We've all been there: you do an analysis, but the results underwhelm. With Data Enhance, our SoTA AI agent updates the data to the story you want to tell 📈 Check it out here! hex.tech/blog/introducing-ma…
2
289
another hard hitting scoop from the information /s
Who leaked this to The Information? ;)
1
3
848
Via @halvarflake 😂 LessWrong forums = "a more high-brow version of 4chan where mostly young men try to impress each other by their command of mathematical vocabulary (not of actual math)"
I have written up some thoughts on AI doom and x-risk, with a focus on why I find a fast-takeoff scenario highly implausible, and why I think that any world model derived from written language is insufficient for advances in real-world technology: addxorrol.blogspot.com/2024/…
1
4
480
this is one of the first examples I've seen of a *real* AI film, with a story, action, and dialogue. not just a bunch of panning shots and hyped up trailer cuts. strong Seventh Seal vibes.
Presenting: The Bridge. An AI Short Film utilizing Veo-2. I’m really proud of this one, as my goal (as always) is to push storytelling, performance, and narrative in this emerging art form. Every shot here utilized Veo-2, although there were a few post-generation tricks here and there. Hedra was used for the lipsync. Writing, Sound, and Editing were done by me. Yeah, I’m pretty proud of this one. Per usual, I’ll have a full run through of the production on the channel in a few days!
1
3
472
HUGE congratulations to you guys, and thank you for bringing us along for the ride!
1
4
1,044
Strong competition for gpt4 level models + decelerating progress on benchmarks ==> huge signal for open source LLMs. Open source will soon have roughly the same accuracy as closed models, but it's extensible, composable, and controlled by the developer.
Today, we're announcing Claude 3, our next generation of AI models. The three state-of-the-art models—Claude 3 Opus, Claude 3 Sonnet, and Claude 3 Haiku—set new industry benchmarks across reasoning, math, coding, multilingual understanding, and vision.
1
3
1,081
This model (diffusion transformer based) looks super cool! Redditors are.. less thrilled 😂
Announcing Stable Diffusion 3, our most capable text-to-image model, utilizing a diffusion transformer architecture for greatly improved performance in multi-subject prompts, image quality, and spelling abilities. Today, we are opening the waitlist for early preview. This phase is crucial for gathering insights to improve its performance and safety ahead of open release. You can sign up to join the waitlist and learn more here: bit.ly/3OR2qQF #stablediffusion3 Prompt: Epic anime artwork of a wizard atop a mountain at night casting a cosmic spell into the dark sky that says "Stable Diffusion 3" made out of colorful energy
3
872
we appreciate you @ClementDelangue 🙂. you & your team have done more for open source AI models than just about anyone.
3
220