6
2
59
39,785
The vibes of this blog post
1/ In 2021, we shared next-gen language + conversation capabilities powered by our Language Model for Dialogue Applications (LaMDA). Coming soon: Bard, a new experimental conversational #GoogleAI service powered by LaMDA. blog.google/technology/ai/ba…
47
972
10,158
1,411,550
We're max 2-3 years out from DALL-E 2 for 3D printing. Literally conjuring objects from incantations.
116
408
4,303
it's important to communicate with your coworkers with kindness and clarity
24
263
3,493
242,369
Made a little CLI that just pipes my programming questions to GPT-3, so I now can ask it stuff when I'm in the command line! LLMs are better than Stack Overflow now — I just ask it, and it gives me a comprehensive answer in one shot, right there in my terminal, in a couple secs.
72
338
3,414
Somewhere nestled deep within the digits of Pi, there exists the full weights of GPT-4, GPT-5, and all future neural networks.
91
148
3,029
602,172
Thiel Fellowship but for paying engineers to drop out of FANG companies
38
104
2,660
NEW PROJECT — I made a "personal search engine" that lets me search all my blogs, tweets, journals, notes, contacts, & more at once 🚀 It's called Monocle, and features a full text search system written in Ink 👇 GitHub ⌨️ github.com/thesephist/monocl… Demo 🔍 monocle.surge.sh/
60
154
1,747
all these fancy nyc cafes with their no laptop policies are getting out of control gonna open a cafe where you can't come in unless you have more than 50 unread emails and a sidebar overflowing w slack notifications and can't leave unless you clear em all out
31
32
1,621
153,469
Life update🎉 I'm very excited to be joining @NotionHQ to continue prototyping and researching ways AI can help us be more creative, thoughtful, and productive! Looking forward to learning from the team and bringing some of my ideas from the past year to a tool loved by many 👇
104
20
1,602
327,575
AI people how are you parsing your PDFs, I need help - TypeScript / Next.js - Both images + text, bonus points for bounding boxes or Markdown - Ideally low latency (<1s for low 100s of pages)
119
49
1,306
301,063
I built a personal chatbot from my personal corpus[1] a couple weeks ago on fully open-source LMs. On a whim I gave it iMessage. Didn't expect the iMessage bit to matter, but it made a huge difference in how it feels to interact. Much more natural. [1] thesephist.com/posts/monocle…
24
96
964
275,414
building bicycles for the mind... 🚴‍♀️
22
34
939
157,054
Diffusion models for handwriting generation! Cool work. arxiv.org/pdf/2011.06704.pdf
25
114
893
Half of @amasad tweets these days are like "Replit users can now run their own fusion reactor from their bedroom. We had this idea last weekend and it took our two engineers two lunch breaks to build. By next month we'll have a million teenage coders generating fusion power."
10
49
873
Code retention/churn over the last ~15 years for Clojure and Scala's codebases. Source: download.clojure.org/papers/…
22
157
858
I've joined @ThriveCapital as EIR & advisor to support founders in understanding and deploying AI thoughtfully while furthering my interpretability and interfaces work. I remain very excited to cheer on Notion's AI team push the frontier of applied LLMs. thesephist.com/posts/thrive/
86
10
823
61,348
Small rant about LLMs and how I see them being put, rather thoughtlessly IMO, into productivity tools. 📄 TL;DR — Most knowledge work isn't a text-generation task, and your product shouldn't ship an implementation detail of LLMs as the end-user interface stream.thesephist.com/update…
28
93
787
i'm losing hope rather quickly that reliable multi-hop reasoning could be solved by LLMs alone without some kind of deterministic logic engine the LLM interacts with. multi-hop reasoning is fundamentally not what LLMs are good at, and training it to do feels asymptotically limited
71
40
776
253,103
Built a token-wise likelihood visualizer for GPT-2 over the weekend. There are some interesting patterns and behaviors you can easily pick up from a visualization like this, like induction heads and which kinds of words/grammar LMs like to guess.
13
77
751
223,875
I desperately need someone to take me out for an evening and say not a single word about anything related to AI. I am drowning.
46
26
708
86,429
it is crazy that sf restaurants are only open from like 5pm to 9pm. how is this a viable business model
49
5
659
80,158
Insane ML paper acronyms continue to proliferate in 2023
16
37
636
99,080
I think too many people talk about how to recreate Bell Labs, and not enough talk about how to recreate OpenAI c. 2015-2019
28
16
602
103,403
y'all need to stop founding LLM infra companies and start going on dates
13
33
600
87,023
Here we go, fine-tuning GPT-3 on the vast majority of my public online writing (half million tokens)...
33
24
600
A tragically underutilized fact in productivity software today is that most people's entire textual datasets for a lifetime can fit in modern PCs' RAM. Just load it up & search it in memory. We don't need to send everything across the planet. Things can be /so much/ faster.
21
46
574
timeline just forked
10
41
556
54,033
🌈 Research blog: thesephist.com/posts/prism/ Text embeddings contain semantic and structural features that we can now automatically discover with SAEs, and use to build rich, interactive new information interfaces. In this write-up on my SAE experiments and a prototype called Prism, I share everything I learned about training text embedding SAEs and what kinds of features they contain. I also share: 1. A new more precise feature intervention/edit method using inference-time gradient descent to minimize interference. 2. How applying warmup to the sparsity penalty leads to more stable early SAE training. I also talk about two primitives that may be a part of future AI interfaces: detailed decomposition of concepts and styles from an input, and precise steering of high-level semantic edits to text (and in the future, any media). A lot of this work was done Dec-Feb, and in the intervening 5 months there's been so much new momentum in the space particularly around applying SAEs to production models and multimodal models like CLIP. But I think this piece has a unique focus on how interpretability enables new interfaces, and its application to text embeddings.
11
72
568
99,731
By end of 2024, steering foundation models in latent/activation space will outperform steering in token space ("prompt eng") in several large production deployments. I felt skeptical about this in summer '23, felt vaguely positive in Jan, and now think it's more likely than not, and I'm more optimistic than ever about the direction of my work since 2022 around a joint exploration of interfaces and latent visualization/steering of foundation models. Anthro's published work today is a milestone in a steady march toward this future that started in mid 2023. Anthropic and DM's leadership in this area, combined with lots of community efforts in work like steering vectors, better sparse autoencoders, various image editing UI prototypes all push us toward the future here, but once technical foundations are there, interface will be much more obviously a bottleneck to utility, alignment, and capability. I'm very excited about the interface possibilities this will open up, particularly for multimodal models and creative use cases. For a moment I thought it was possible that dialogue may eat everything. I don't think so anymore. We'll see new universes of possibilities in both. (And if frontier labs don't have serious interface research bets today, this would probably be a good time to reconsider it :-)
21
50
580
239,426
Hypothesis: information work is overwhelmingly bottlenecked on availability of high-signal context more than by correct inference over the context. If right, implies higher ROI-per-flop of context building over pure logical inference. h/t @anandnk24
41
51
550
69,306
Today's experiment 🪄— Inverting OpenAI's embedding-ada-002 model to reconstruct input texts from just embeddings. A LOT of interesting tidbits here. I'll begin with these (cherry-picked) samples. Left column is input, middle is reconstructed from each paragraph's embedding only
16
57
540
122,009
👋
7
1
484
59,478
NEW DEMO! Exploring the "length" dimension in the latent space of a language model ✨ By scrubbing up/down across the text, I'm moving this sentence up and down a direction in the embedding space corresponding to text length — producing summaries w/ precise length control (1/n)
14
53
475
ML research githubs are all like "This repo lets you reproduce and build on our results. Simply run ./scripts/train_best_model.py --with-ffn=32 --gru-cache=100 --magic-sauce --m_dims=3.1415926535 --unicorns=no_exist * --m-dims should be set to exact value of Pi for best results
10
32
443
some life update🚀 Last week was my last at Ideaflow. Starting 2022, I'm working full-time on building products, prototypes, and experiments investigating how we can build better software tools for creating and thinking. More coming soon, but wanted to get the news out early :)
42
1
472
More people should create things to proliferate an aesthetic into our future, not just to solve problems. This is the quality that every artist and engineer I respect shares most universally. Without this, you are doomed to churning out slop.
31
61
471
52,625
More people should play with base models. They have a distinct feeling of simulation and "intelligence grown from the world" rather than injection-molded from obedience that instruction-tuned models have. They are state-of-the-art world simulators, not question-answerers. Below: some llama3 8b samples conditioned on ~100 words from my blog. To my eyes, they sound like me, and have ideas I'd consider interesting enough to me for me to write about. They have my voice and style. Critically, they are super far from anything you'd get from ChatGPT/Claude (last pic), which sounds generic, overabstracted, and nothing like me.
25
37
463
46,882
it's really hard as a young engineer i think to switch from thinking of sw engineering as the practice of building static artifacts (code) to the practice of building and maintaining living systems involving people and running computers. but i think that makes a big difference.
15
32
459
22,357
Single-purpose computers! They're great.
10
10
434
Weird idea: chunk size when doing retrieval-augmented generation is an annoying hyperparam & feels naive to tune it to a global constant value. Could we train an e2e chunking model? i.e. system that takes in a long passage, and outputs a sequence of [span, embedding] pairs?
38
19
449
184,585
Replying to @mariosangiorgio
This is hilariously smart haha Who needs Ctrl-save when you can just increment the pc manually when your editor hangs.
2
89
405
Wow, I just got @AnthropicAI's sparse autoencoder-based feature decomposition technique to work* for text embeddings 🎆 Screenshot below. In order, this output shows: 1. max-activating examples for that feature from the Minipile dataset 2. min-activating examples from the same dataset 3. Original sentence, and reconstruction from my (dense, non-interpretable) text autoencoder (Contra) 4. Reconstructed text with the feature (a) off, (b) set to the dataset mean (c) set to the dataset max A big blocker to using embeddings for text editing, and for compressing embeddings further, has been that we don't have a good way of knowing *how* to modify an embedding to express or erase a desired feature. This points to a potentially scalable, effective way to do so. * Dense text autoencoder is a Contra-small (330M params). Sparse autoencoder goes from 512 to 4096 hidden dims trained on about 30M embeddings. This is from ~3 days of work, so lots of room for improvement ahead, including making the "works" definition much more robust by showing features discovered this way are more interpretable than raw embedding dimensions. This has been a big goal for my embeddings work this year! Very excited this shows signs of life, and this technique seems to transfer surprisingly well, from LLM activations to embeddings. This, and other non-cherrypicked examples, below:
10
31
422
87,496
During my time at @NotionHQ, I got to think deeply about AI's future from the lens of design and architecture, rather than just technology. I ended up writing a piece during my last few weeks about the potential and risks of AI in analogy to plastic 🧱 notion.so/blog/ai-is-the-new…
18
35
398
79,200
heyyyyyyyyyyyy.com is #2 on Hacker News LMAOO I can't 😂
19
14
383
This is your periodic reminder that user interfaces are important, and text is a good lowest common denominator, not the endgame. The world and our senses have a lot more to offer.
16
27
375
57,878
re: chiang— it's futile to argue that a paintbrush, because it does not experience humanity, cannot create art. It makes two category errors: one, mistaking that art is created by tools rather than artists; and two, that art is christened by construction, rather than culture.
12
39
366
34,975
Embedding features learned with sparse autoencoders can make semantic edits to text ✨ (+ a reading/highlighting demo) I've built an interface to explore and visualize GPT-4 labelled features learned from a text embedding model's latent space. Here's a little video, more in 👇
14
37
362
52,614
can we not turn twitter into linkedin actually
4
8
348
This morning, I've been sketching out ideas for a chat interface to language models that treat branching/multiple timelines as a first-class concept and try to make heavily branch-y threads navigable. Some notes I've been taking... thesephist.notion.site/Bette…
20
28
357
70,228
Brewing currently 🧪 Exploring a language model's latent space on a connected canvas, branching from a single idea through connections to a tree of alternate realities.
12
28
352
Thinking about notation design again...
11
24
333
42,010
I sat down with @danshipper to talk about how I work! I go through the tools I use for my work and why, focusing on the ones that leverage LLMs to help me read and think. Also some peek into my past prototypes, and recs for book that inspire my work 📚 every.to/superorganizers/lin…
12
30
336
stanford symbolic systems is the hottest major in tech no room for questions
20
8
337
116,752
Replying to @Mantia @eliz_kilic
I don't think a slightly chubbier octothorpe looks bad though if you balance the margins right
16
19
310
How to commit to the right opportunities
7
23
332
A lot of you here, for some reason I don't quite understand, have cared about my work for a long time — far before I thought it became worth listening to. I am reminded today how special that is. It has led to innumerable friendships and opportunities I am lucky to enjoy, which I can't ever forget.
35
3
342
28,416
Things that are horrifically harder than they should be: - Text rendering - Rich text editors - Implementing undo/redo that won't make you pull your hair out (when mixed with autocorrect, formatting, page navigation, etc. etc.) Tonight I'm wrestling with the third, apparently!
12
7
331
Good tools admit virtuosity — they have low floors and high ceilings, and are open to beginners but support mastery, so that experts can deftly close the gap between their taste and their craft. Prompt engineering does not admit virtuosity. We need something better.
9
39
318
Not a single vector DB in sight. I have found peace 🏝️
5
4
328
40,581
Earlier this month, I gave a talk about some of the technology we’re building at Thrive & the philosophy behind it. One focus for us has been internal tools that augment our team's work, including Puck, an in-house research system that runs in chat & ambiently in the background.
14
9
357
96,854
Quick little hack 🦄 — a GPT token probability visualizer Given lots of interest in my little LLM visualization from earlier in the year and a little encouragement from @simonw, I decided to break this out into its own little fully client-side app! 🔗 perplexity.vercel.app/
8
47
319
45,592
It's been a wild week for me. - 2x HN, 2x @ProductHunt - 100k site visits - 1.4k👉2k followers - Good convos w/ founders, VC folks My main takeaway: There is SO MUCH room in the world for projects that don't necessarily aspire to solve the world's problems. Fun is ok, too.
11
8
319
fun discovery: the original "scratchpad" paper that's the closest precursor to o1 (3yrs ago) was a ICLR 2022 rejection openreview.net/forum?id=iedY…
3
29
318
32,431
We don't talk enough about the fact that most creative software on the computer works by simulating a fake piece of paper and a fake typewriter or pen just so we don't have to think of or learn fundamentally new interaction modes for this fundamentally new medium.
19
27
302
You guys are telling me we are going to invent literal superintelligence and we are going to interact with it by sending texts
27
10
293
40,023
Even the best current "tools for thought" apps require you to remember to manually make all the connections between your ideas. Is anyone working on making the computer participate and help in this process? Suggesting connections? Finding missing links? I want to talk to you 👋
41
15
286
Open source: a story in 4 parts
4
15
276
41,623
a beautiful name for a baby boy
10
7
281
26,293
just learned there's no uniqlo in SF???? how do y'all live here
51
1
266
48,376
So I haven't done this before for some reason, but I laid out all my projects listed on thesephist.com/projects/ side by side, and... ... yeah. I've been busy 😂 A little over 120 projects in all, most of them still functional and online! Gotta celebrate milestones sometimes 💪
17
6
268
At @aiDotEngineer this evening, I shared that the text autoencoder model I've been prototyping with, which I call Contra ✨, is on @huggingface! Some starter code + demos👇 Colab notebook — linus.zone/contra-colab Slides — linus.zone/contra-slides Model — linus.zone/contra
15
27
260
89,547
There Are So Many PromptOps Tools And I'm Sold On None Of Them stream.thesephist.com/update…
19
18
264
79,669
if artificial neural networks are a kind of alien intelligence, can we use it to imagine alien languages? how could a NN teach itself to "write down" information without any human priors of what writing looks like?
16
16
256
26,684
A while ago I complained here about persistent storage in Google Colab. Have been using @LightningAI Studios for a while now for: - Full VSCode (incl. GH Copilot) - Persisted files shared across notebooks - Multi-GPU/node (!!) It's been great. Feels like a remote ML workstation
6
33
259
56,269
stand back, I'm a professional -- >>>content.split("...")[2].strip().split('"""')[0].strip().split("\n")
22
6
253
35,010
Think it, make it. Come talk to @NotionHQ at Config for a zine about serious play and creating! ✍️🎨
7
7
257
22,383
mentally i am here
9
20
255
20,739
💭 Synthesizer for thought: thesephist.com/posts/synth/ What kind of weird interfaces may be worth thinking about in a world where we can interact with ideas as mathematical objects? We might find an interesting analogy in synthesizers, a tool to create and interact with sound as mathematical objects that has opened up new genres and practices in music. A synthesizer produces music very differently than an acoustic instrument. It produces music at the lowest level of abstraction, as mathematical models of sound waves. It’s a way of producing sound by assembling it from logical components rather than creating it wholesale by hitting or vibrating something natural. The synthesizer is just one example of a pattern in the history of media: with breakthroughs in mathematical understanding of a medium, come new tools that exploit that mathematical understanding to enable new creative forms and human interfaces. What could that look like for writing and thinking?
6
26
257
22,569
Most people don't want a Photoshop for Stable Diffusion; they want an Instagram. stream.thesephist.com/update…
10
11
255
Some of the bigger pieces on the board seem to be moving...
9
6
251
30,368
what if dialogue with language models weren't so asymmetric? i've been experimenting with steering & finetuning models to evoke more "texting a friend" less "assistant / therapist". (i'm on the right in these screenshots, finetuned LLM on the left)
18
10
249
28,800
Sometimes I feel like there are two visions of the future at the edges of tech right now: To engineer scarcity into everything (crypto) To engineer scarcity out of everything (generative AI) Cyberpunk vs. solarpunk. Singularity vs. singularity.
12
32
237
Today @jasonyuan was like "I'm gonna show you a quick 5sec demo" and I saw it and I don't think I'll ever look at software the same again.
10
2
249
184,531
I wrote a bit about the "ChatGPT voice", "Midjourney style", why they happen, and some ways out that I can imagine. 🔗 thesephist.com/posts/epistem…
10
13
246
40,466
Please dear god somebody make something that doesn't need a model picker
21
5
248
19,587
Thinking about launching an OnlyFans but you get to see all my private repositories on GitHub instead 💋
9
17
239
Thinking about Makepad's continuous code folding animation again. Feels like we should be able to do this with prose text now — find the key ideas/sentences and zoom out the rest of a document.
6
18
244
26,428
Thinking about building a "personal search engine" A search engine that only indexes my blog, my Tweets, my journal, my calendar/email and contacts, my photos, and browser history. I want to have better memory without having to remember more stuff. What else should it index?
36
4
244
RT if you're also a grandpa
4
10
240
Technical talk doesn't have to mean slides that look like you just picked a default template from canva/keynote.
15
5
242
20,163
Some thoughts on building trustable agents, and complete task delegation to LLMs. stream.thesephist.com/update…
18
19
235
34,384
Encouraged by some conversations I've had recently, I put together a list of links/papers/reports you might find interesting if you like my work. Covers interpretability + model visualization, interface thinking, stories/fiction. I'll be adding more. notion.notion.site/notion/Li…
5
25
226
20,505
Epoch AI seems to be doing a lot of great work across the board, from FrontierMath to their detailed newsletters. Impressive example of clear-eyed field building. Very little hype, lots of substance.
14
223
12,843
Back in 2022 in my ✨experimental✨ era I wrote down a whole bunch of ideas for tools and interfaces I want to make, but didn't get to actually prototype many of them. Here's a thread of the ones I think would still be interesting, starting with this weird mobile browser concept.
3
14
231
31,374
*sits down for a date* "So, I'm training a chinchilla-optimal open source 7B parameter LLM."
17
4
226
36,826
to date, this is still the best demo I've built/found to explain to folks outside of NLP how an LLM works. Interactively visualizing autoregressive sampling from a GPT-style model.
Built a token-wise likelihood visualizer for GPT-2 over the weekend. There are some interesting patterns and behaviors you can easily pick up from a visualization like this, like induction heads and which kinds of words/grammar LMs like to guess.
4
28
217
44,833
this thought brought to you from the hopeless trenches of prompt engineering
10
2
214
15,272
entering a group chat should feel like this
8
13
216
20,811
This is a really interesting way to visualize QKV attention! I don't think I've seen it anywhere else. The embeddings as visualized here are kind of useless but combined with sparse autoencoder-based features from more recent work, might be interesting? source: chemBERTa paper
4
29
215
37,244