Asst. Prof of Ling, and Harrington Fellow at @UTAustin. language, concepts, and generalization. also on the site where the sky is blue. Aspiring wugologist

Austin
The compling group at UT Austin (sites.utexas.edu/compling/) is looking for PhD students! Come join me, @kmahowald, and @jessyjli as we tackle interesting research questions at the intersection of ling, cogsci, and ai! Some topics I am particularly interested in:
2
33
118
40,532
NewsπŸ—žοΈ I will return to UT Austin as an Assistant Professor of Linguistics this fall, and join its vibrant community of Computational Linguists, NLPers, and Cognitive Scientists!🀘 Excited to develop ideas about linguistic and conceptual generalization! Recruitment details soon
48
21
285
22,979
πŸ”‘ πŸ—οΈ The Article+Adjective+Numeral+Noun (e.g., β€œa lovely five days”) construction is quite rare, and yet people and LMs know it’s grammatical. What is the key to learning such a rare construction? @kmahowald and I answer this question in the context of LMs in our new paper:
2
36
159
38,879
Our paper on analyzing language modeling acceptability judgments with systematically manipulated contexts was recognized as an outstanding paperβ€”thanks so much to the reviewers and the best paper award committee!
14
11
154
18,349
Now that everything is finally signed, here’s some exciting news: I’ll be joining UT Austin β˜€οΈ as a postdoc with @kmahowald this fall! And then in fall 2024, I’ll join TTIC (@TTIC_Connect) πŸ’¨ as a research assistant professor! Feeling extremely blessed 🐧 1/
26
6
145
24,274
β˜€οΈπŸ‚ -> πŸƒ πŸ™οΈ Some news! I’ve moved to Chicago to start as a [Research [Assistant Professor]] (independent postdoc w/ PI status) at @TTIC_Connect next week! But I’m feeling very bittersweet, because I spent a wonderful 12 months at UT Austin postdoc-ing with @kmahowald
17
4
124
11,211
Yesterday, I successfully defended in the presence of my wonderful friends, colleagues, and family! Nothing has fully sunk in, but I want to take a moment to acknowledge the indispensable role of two people to whom I owe my entire career: @RayzJulia and @AllysonEttinger 1/6
15
6
120
13,999
πŸ§πŸ”‘πŸ€– Can LMs/NNs inform CogSci? This question has been (re)visited by many people across decades. @najoungkim and I contribute to this debate by using NN-based LMs to generate novel experimental hypotheses which can then be tested with humans!
2
14
83
14,917
Twitter (?)'s new logo in a couple lines of latex: \usepackage{bbold} $\mathbb{X}$
4
2
77
29,488
Allyson (@AllysonEttinger), Julia, and I are incredibly honored to receive recognition for COMPS -- thanks so much to the awards committee and the reviewers!
Congratulations to our 2 Best Paper recipients and our Best System Demonstration recipients: docs.google.com/document/d/1… #EACL2023 #eacl #NLProc #bestpapers
14
3
75
20,159
πŸ™ really a lucky cherry on top because working with Kyle itself is an award!
Chuffed that @kanishkamisra on β€œcontrolled rearing” to learn the β€œa beautiful five days in Miami” arxiv.org/abs/2403.19827 construction won an EMNLP Outstanding Paper Award! And delighted that the ACL community saw fit to recognize a paper about an odd little ling construction.
10
3
73
4,284
Looking forward to attending #cogsci2025! I’m especially excited to meet students who will be applying to PhD programs in Computational Ling/CogSci in the coming cycle. Please reach out if you want to meet up and chat! Email is best, but DM also works if you must quick🧡:
1
24
74
11,047
Controlled zero-shot evals have revealed holes in LMs’ ability to robustly extract and use meaning. But what happens when you add experimental context (ICL/instructions)? With @AllysonEttinger & @kmahowald, I explore this in the context of semantic property inheritance: 1/13
2
13
68
19,019
Excited to finally share COMPS, a collection of english minimal pair stimuli to analyze conceptual knowledge in language models! Work with @RayzJulia and @AllysonEttinger, to be presented at EACL (main)! Paper: arxiv.org/abs/2210.01963 Code: github.com/kanishkamisra/com… Thread: ⬇️1/
3
13
64
10,048
*NEW PREPRINT* I built a simple python package called minicons, to facilitate behavioral and representational analyses of transformer LMs. paper: arxiv.org/abs/2203.13112 code: github.com/kanishkamisra/min… paper-experiments: github.com/kanishkamisra/min… 1/n
5
15
63
Excited that this work (w/ @kmahowald ) will be presented at #EMNLP2024! I must note that the ARR/@ReviewAcl experience for this paper was nothing short of excellent -- thanks to a thoughtful bunch of reviewers!
πŸ”‘ πŸ—οΈ The Article+Adjective+Numeral+Noun (e.g., β€œa lovely five days”) construction is quite rare, and yet people and LMs know it’s grammatical. What is the key to learning such a rare construction? @kmahowald and I answer this question in the context of LMs in our new paper:
4
1
61
7,348
Heading to NYC to spend some time at Google Research as a fall intern! Excited to hopefully run into and learn from all the brilliant folks in the NYC NLP x CogSci community as well (psst, hit me up!) Maybe I will also get my 3-month traveling NOVID challenge badge πŸ€·β€β™‚οΈπŸ™
3
1
49
Excited to travel to Boston and present work on category abstraction in LMs w/ the wonderful @najoungkim at @TheBUCLD! Our poster presentation is on Nov 4! 🐈 πŸ†Ž ThreadπŸ‘‡
3
8
44
11,338
#TidyTuesday Submission for this week showing distribution of percentages of people in each state by employment. Data from Kaggle! #rstats #tidyverse #ggridges
6
3
36
#TidyTuesday after a long time! This time visualizing changes in life expectancy in Rwanda and Cambodia, two big outliers in the graph who also suffered in their life expectancy due to genocide :( Feedback welcome! #rstats #tidyverse
2
6
34
My first submission to #TidyTuesday that shows paths of randomly selected countries (to avoid 'bias') in their share of deaths in the top two causes of death. Data from @OurWorldInData #rstats #tidyverse
3
6
36
Implemented a basic feedforward neural network in R using @dvaughan32's rray and the R6 OOP system. Will start working on a blog post (or two) soon. But here's a preview. #rstats
3
2
37
Not one for bean counting, but minicons recently reached 100+ stars, and it's been a great source of fleeting joy in this month of craziness (7 days in)! Thanks everyone for all the support, especially contributors and ppl who use it to teach Comp-Ling classes!
4
2
38
10,591
Interested in evaluating the ability of #NLProc language models to encode properties of everyday concepts such as robin/table/shark? About to/have used the CSLB or McRae property norms dataset? A long cautionary thread (1/10):
2
3
35
Believe it or not, #ACL2023NLP will be my first in-person ACL! Looking forward to meeting old friends and new! 🀩
3
32
5,421
Being super ambiguous (am sure everyone knows what I mean): I have no idea what to look forward to anymore, with the news I see every day. I’m lucky for all the opportunities I’ve gotten and am sad fewer people will be in my position in the future with the way things are going :(
2
34
3,654
You're getting your first united goal this weekend, heard it here first
1
29
Replying to @tallinzen
Seems like the model can’t address the elephant in the prompt :(
1
31
New paper to be presented on Saturday (July 30) at #CogSci2022 @cogsci_soc We (@AllysonEttinger, Julia Rayz, and I) present a paradigm to perform property induction using LLMs! paper: escholarship.org/uc/item/617… (WARNING: long thread 😬)
1
6
31
everyone hates prompt engineering until they actually do it, after which they hate it even more
4
2
31
4,407
Thanks to @drob and @thomasp85 for gganimate and tweenr!
3
8
32
Excited to spend a beautiful five days in miami attending #EMNLP2024! I’m presenting two papers and am looking forward to meeting friends, foes, and undecideds!
2
2
30
1,724
only took ~1 month to appear on arxiv (was on hold for some reason) but @najoungkim's and my BUCLD work is now out (and citable πŸ˜‰) arxiv.org/abs/2312.03708
2
3
30
4,254
despite a professionally awesome year, mental health is down the dumpsters. here’s a yearly reminder to take care of yourselves no matter how many highs you’ve had!
3
1
29
1,980
I will unfortunately have to skip SCiL this year, but I am thrilled to share that Jwalanthi will be presenting this work by her, @Robro612, me, and @kmahowald on a tool that allows you to project contextualized embeddings from LMs to interpretable semantic spaces!
2
3
32
1,263
colm sent us the acceptance announcement / colm sent the acceptance announcement to us
LMs learn argument-based preferences for dative constructions (preferring recipient first when it’s shorter), being quite consistent with humans. Is this from just memorizing the preferences in their training data? New paper w/ @kanishkamisra, @LAWeissweiler, @kmahowald
1
1
32
1,538
Come be my colleague at @TTIC_Connect starting next Fall and apply to their research assistant professor program! Up to 3 years of research funding + no teaching reqs + PI status! I am happy to share any insights I can offer! Deadline is December 1! facapp.ttic.edu/
10
30
8,674
Pleased to announce that our paper on investigating pre-trained language models for conceptual typicality (bird -> robin; furniture -> sofa) has been accepted as a talk to #cogsci2021 @cogsci_soc w/ @AllysonEttinger @RayzJulia (1/5)
3
4
29
Submitted my first code contributing PR to @drob's widyr package, feels great to have started contributing to the #rstats community! Will most probably blog about what I did along with examples! :D
1
23
@judyefan @flxbinder, Jay McClelland, & I are planning to organize an affinity group at @cogsci_soc this year on "Neural Network models of Human Cognition" We are looking for a volunteer to help with discussion groups on zoom/gather for folks attending remotely. DM if interested!
1
5
25
leaving purdue after 9 years (will be back to walk later this year)! It’s been real but looking forward to spending time at my new temp academic home at @UTAustin!
28
4,759
UT’s loss is NYU’s gain!! Sad to be missing Greg at UT but forever excited about what comes out of TAUR Lab!! (And that’s one fantastic logo migration job)
πŸ“’I'm joining NYU (Courant CS + Center for Data Science) starting this fall! I’m excited to connect with new NYU colleagues and keep working on LLM reasoning, reliability, coding, creativity, and more! I’m also looking to build connections in the NYC area more broadly. Please reach out if you're interested in chatting! This move comes after 8 years working with incredible students and collaborators at UT Austin. Thank you to everyone who supported me in my first academic appointment; I look forward to continuing our collaborations but I will miss you! (and the breakfast tacos!)
28
2,323
So excited about this work: taking inspiration from psych and combining it with tools from interpretability research to analyze LMs! Read the thread + paper for the science; in the meantime I will use this space to talk about how awesome it was to work with my co-authors:
How do language models organize concepts and their properties? Do they use taxonomies to infer new properties, or infer based on concept similarities? Apparently, both! 🌟 New paper with my fantastic collaborators @amuuueller and @kanishkamisra!
1
26
1,964
Happy that this too was accepted at #EMNLP2024! Check out the updated arxiv version:
Controlled zero-shot evals have revealed holes in LMs’ ability to robustly extract and use meaning. But what happens when you add experimental context (ICL/instructions)? With @AllysonEttinger & @kmahowald, I explore this in the context of semantic property inheritance: 1/13
2
2
27
2,509
did you all know that CogSci/psychology based evals of LMs only started in 2022????!!!!
2
1
23
2,942
A bunch of amazing people and me @ Versailles!
2
26
3,897
Lucky to be hosted by one of my oldest friends who happens to have this view from his balcony, so blessed!! See you all at #ACL2023NLP tomorrow! ❀️❀️
1
24
3,049
β˜€οΈminicons πŸŒ– now supports sequence scoring with Vision-Language Models!! Looking forward to see how ppl use it (if at all!) -- feedback always welcome! 🐧🐦
4
2
24
2,339
Extremely heartbreaking :( Felix’ work has had a deep impact on me and I had hoped to meet and talk to him some day β€” my heartfelt condolences to his loved ones.. Please check in on your friends :(
I’m really sad that my dear friend @FelixHill84 is no longer with us. He had many friends and colleagues all over the world - to try to ensure we reach them, his family have asked to share this webpage for the celebration of his life: pp.events/felix
1
1
24
2,231
If em dashes are distinctive of LM-generated writing then most of my papers are LM-generated :( not cool
2
1
24
2,342
There's a known bug in how we compute "word" probabilities with subword-based LMs that mark beginnings of words -- as pointed out by @byungdoh (w/ Will Schuler), & @tpimentelms and @clara__meister I'm pleased to announce that minicons now includes a fix which runs batch-wise!
2
3
24
2,128
"Seeing" robins and sparrows may not necessarily make them birdier to LMs! Super excited about this paper -- massive shoutout to all my co-authors, especially @yulu_qin and @dhevarghese for leading the charge!
Does vision training change how language is represented and used in meaningful ways?πŸ€” The answer is a nuanced yes! Comparing VLM-LM minimal pairs, we find that while the taxonomic organization of the lexicon is similar, VLMs are better at _deploying_ this knowledge. [1/9]
4
22
1,348
The graded status of concepts is central to cogsci research -- some items (robins) are more typical members of their category (birds) than are others (penguins). Is this something that is picked up by current #NLProc language models? 1/3
1
2
20
give me logprobs, at least for the inputs tyvm
what would you like openai to build/fix in 2024?
1
21
2,741
Happy to talk about CogSci based eval of nlp systems (perils/non-perils), conceptual meaning understanding, wugs, daxes, and your own research at #ACL2023NLP!
Believe it or not, #ACL2023NLP will be my first in-person ACL! Looking forward to meeting old friends and new! 🀩
1
19
2,028
A welcome change to the CogSci 2022 submission policy! Thanks @cogsci_soc!
1
2
18
2019 #rstats goals: 1. Learn more rcpp! 2. Help improve NLP tools in R (easy access to word and sentence representations would be a start) 3. Start an R seminar at school
1
15
In case people recovering from colm want more lm content:
October 11th at 12:30pm CT: Research@TTIC presents Kanishka Misra (@kanishkamisra) with a talk titled "Controlled Rearing of Language Models can Reveal Linguistic Insight." Please join us in Room 530, or stream via Panopto: buff.ly/3NeZvQS
2
19
2,346
So ARR doesn't want any of us to have a weekend... Got it! 4 days (2 of which = weekend) for rebuttals is a bit too crazy, friends!
2
20
1,760
If the EMNLP presentation format is either oral OR poster, why do the authors of papers that got an oral slot have to make posters? Where will these posters go?
1
1
19
13,155
Replying to @deliprao
The Big Book of Concepts by Greg Murphy (thanks to @LakeBrenden who used it in his course, which is how I stumbled upon the book)
1
1
18
A big todo for minicons! πŸ‘€
Hey #NLProc and #psycholing Twitter :) We found a bug in how we're all computing contextual word probabilities and wrote a paper about it! It's a very easy fix, so please check it out! +@clara__meister
2
18
1,700
Another analysis inspired by @kearneymw 's really quick work on the authorship of the nyt op-ed. Possible improvements: weigh function words more - works wonders in stylometry sometimes, get better data(than tweets) Code: github.com/kanishkamisra/ins…
2
5
17
Go work with the amazing @kmahowald, @jessyjli, and Katrin! Bonus: also get a chance to interact with @gregd_nlp and @eunsolc and their groups!
I (+ others) are accepting grad students here in Austin! We have an exceptionally vibrant comp ling community across linguistics and CS here at UT nlp.utexas.edu/. I’m based in linguistics but happy to work across departments. Topics I can or hope to be able to fund:
2
16
2,582
Week 6 #TidyTuesday submission, this one shows the spread of Starbucks and Dunkin' Donuts coffee chains in the top 10 states in USA by coffee shops. I STREAMED while coding this so if you are interested in watching that, check out twitch.tv/iamasharkskin #rstats #dataviz
2
2
13
can someone pls remove the "em-dashes" feature from all LLMs??? I refuse to give up on 'em
17
1,668
Reducing naacl fomo by staring at Lake Michigan
19
742
Replying to @yoavartzi
colm-base, colm-small, colm-medium, colm-large, … πŸšͺπŸšΆβ€β™‚οΈ
18
519
2-page abstract (and poster) now available on kanishka.website/publication… Also check out the rest of the program curated by BUCLD's hard-working committee (of students, iirc!): bu.edu/bucld/schedule/ Glad that @najoungkim introduced this conf to me!
Excited to travel to Boston and present work on category abstraction in LMs w/ the wonderful @najoungkim at @TheBUCLD! Our poster presentation is on Nov 4! 🐈 πŸ†Ž ThreadπŸ‘‡
3
2
18
3,172
ACL day one = blessed! Finally met @sebschu irl, and two people came to me to say nice things about minicons! πŸ˜‡
16
1,363
I bet the first thing I’ll ctrl f in my finished draft of the dissertation will be β€œlangauge”
1
15
657
can’t be disappointed in your NeurIPS reviews if you don’t submit :thinksmart:
16
2,519
My timeline is full of Game of Thrones posts what's happening guys #GameofThrones
10
colm : collum :: lm : lum?
1
1
14
7,156
Such a timely piece about the role of NNs in CogSci πŸ€–! A bit of a shameless self promo, but also check out recent work from @najoungkim and me on generating experimental hypotheses from LMs (for datives), super relevant to discussion in Section 2.4: arxiv.org/abs/2408.05086
New paper out πŸŽ‰ !! @linguistMasoud and I wrote a perspective on how modern LMs and AI models can be useful for studying language acquisition in kids. Check out the open source paper in Language and Linguistics Compass: dx.doi.org/10.1111/lnc3.7000… and this original thread:
1
3
17
1,808
All I could think about when I saw the high school paper track: Gotta start them young!
2
15
1,436
"I happy to shed lighter"
17
Knew this question would come some day...
2
16
2,450
the colm before the storm
1
1
18
2,571
minicons now supports the `within word l2r' method for masked language model scoring, developed by the amazing @KaufCarina (thanks for the PR!) and @neuranna πŸŽ‰πŸ€–
1
2
15
1,241
lower your expectations and amazing things will happen to you!
1
2
14
2,460
Pleased to announce that my paper with @AllysonEttinger and Julia Rayz has been accepted for publication in Findings of #emnlp2020.
1
2
15
Due to @amuuueller's wise insistence, minicons now supports multiple devices and quantization! You can now quickly run log-prob based evals on huuuge LMs!
1
14
1,154
Why does neurips hide the reviewers' updated scores (or lack thereof)? "this is to maximize misery for everyone" - aptly described by @najoungkim
16
2,095
If you look deep enough, I think that you might find that understanding means: tensor([[-0.0056, -0.0124, -0.0363, ..., 0.0041, 0.0111, 0.0141]])
16
351
another day another minicons update (potentially a significant one for psycholinguists?) "Word" scoring is now a thing! You just have to supply your own splitting function! pip install -U minicons for merriment
1
1
16
591
Replying to @_jennhu
Congrats to both J HUs!
1
15
1,096
I love how the ao-childes validation set ends:
1
13
1,479
Replying to @tallinzen
Some papers that immediately come to mind for CogSci -> NLP: - arxiv.org/abs/2008.01766 - pnas.org/doi/full/10.1073/pn… - all responses to @Joe_Pater’s call for DL x Ling (I guess these would go both ways)
1
1
15
have good academic news but you'd have to go to site with blue colored sky to check it out, sorry
15
583
Come chat with me about robins, penguins, wugs, daxes, and a sprinkle of inverse scaling in LMs at the virtual poster session #3 on gather town! #eacl2023
Presenting COMPS remotely at #EACL this week! Join me at the Virtual Poster Session #3 (May 2 @ 2:15 pm CEST) or come to my talk in the interpretability session (May 3 @ 9 am CEST)
1
15
1,465
soap: sick of ai papers
1
1
13
4,107
If people are just now trying to reach their hotel in Toronto and crying because of the traffic, it’s because of a BeyoncΓ© concert. ur welcome!
2
1
14
2,017
statistics = only fun when it works
1
14
856