Kanishka Misra 🌊 (@kanishkamisra) | nitter

Pinned Tweet

Kanishka Misra 🌊@kanishkamisra

30 Sep 2025

The compling group at UT Austin (sites.utexas.edu/compling/) is looking for PhD students! Come join me, @kmahowald, and @jessyjli as we tackle interesting research questions at the intersection of ling, cogsci, and ai! Some topics I am particularly interested in:

Picture of the UT Tower with "UT Austin Computational Linguistics" written in bigger font, and "Humans processing computers processing human processing language" in smaller font

ALT Picture of the UT Tower with "UT Austin Computational Linguistics" written in bigger font, and "Humans processing computers processing human processing language" in smaller font

2

33

118

40,532

Kanishka Misra 🌊@kanishkamisra

2 Jun 2025

News🗞️ I will return to UT Austin as an Assistant Professor of Linguistics this fall, and join its vibrant community of Computational Linguists, NLPers, and Cognitive Scientists!🤘 Excited to develop ideas about linguistic and conceptual generalization! Recruitment details soon

Picture of the UT Tower taken by me on my first day at UT as a postdoc in 2023!

ALT Picture of the UT Tower taken by me on my first day at UT as a postdoc in 2023!

48

21

285

22,979

Kanishka Misra 🌊@kanishkamisra

2 Apr 2024

🔑 🗝️ The Article+Adjective+Numeral+Noun (e.g., “a lovely five days”) construction is quite rare, and yet people and LMs know it’s grammatical. What is the key to learning such a rare construction? @kmahowald and I answer this question in the context of LMs in our new paper:

Title page of our paper: “Language Models Learn Rare Phenomena from Less Rare Phenomena: The Case of the Missing AANNs” - Kanishka Misra and Kyle Mahowald

ALT Title page of our paper: “Language Models Learn Rare Phenomena from Less Rare Phenomena: The Case of the Missing AANNs” - Kanishka Misra and Kyle Mahowald

2

36

159

38,879

Kanishka Misra 🌊@kanishkamisra

11 Jul 2023

Our paper on analyzing language modeling acceptability judgments with systematically manipulated contexts was recognized as an outstanding paper—thanks so much to the reviewers and the best paper award committee!

14

11

154

18,349

Kanishka Misra 🌊@kanishkamisra

1 Aug 2023

Now that everything is finally signed, here’s some exciting news: I’ll be joining UT Austin ☀️ as a postdoc with @kmahowald this fall! And then in fall 2024, I’ll join TTIC (@TTIC_Connect) 💨 as a research assistant professor! Feeling extremely blessed 🐧 1/

26

6

145

24,274

Kanishka Misra 🌊@kanishkamisra

26 Aug 2024

☀️🐂 -> 🍃 🏙️ Some news! I’ve moved to Chicago to start as a [Research [Assistant Professor]] (independent postdoc w/ PI status) at @TTIC_Connect next week! But I’m feeling very bittersweet, because I spent a wonderful 12 months at UT Austin postdoc-ing with @kmahowald

17

4

124

11,211

Kanishka Misra 🌊@kanishkamisra

21 Jul 2023

Yesterday, I successfully defended in the presence of my wonderful friends, colleagues, and family! Nothing has fully sunk in, but I want to take a moment to acknowledge the indispensable role of two people to whom I owe my entire career: @RayzJulia and @AllysonEttinger 1/6

Dissertation title slide with title: "On Semantic Cognition, Inductive Generalization, and Language Models"

ALT Dissertation title slide with title: "On Semantic Cognition, Inductive Generalization, and Language Models"

15

6

120

13,999

Kanishka Misra 🌊@kanishkamisra

21 Aug 2024

🧐🔡🤖 Can LMs/NNs inform CogSci? This question has been (re)visited by many people across decades. @najoungkim and I contribute to this debate by using NN-based LMs to generate novel experimental hypotheses which can then be tested with humans!

Title of the paper: Generating novel experimental hypotheses from language models: A case study on cross-dative generalization; along with a figure describing our overall method

ALT Title of the paper: Generating novel experimental hypotheses from language models: A case study on cross-dative generalization; along with a figure describing our overall method

2

14

83

14,917

Kanishka Misra 🌊@kanishkamisra

24 Jul 2023

Twitter (?)'s new logo in a couple lines of latex: \usepackage{bbold} $\mathbb{X}$

ALT mirror image of twitter's logo using the bbold latex package.

4

2

77

29,488

Kanishka Misra 🌊@kanishkamisra

4 May 2023

Allyson (@AllysonEttinger), Julia, and I are incredibly honored to receive recognition for COMPS -- thanks so much to the awards committee and the reviewers!

eaclmeeting @eaclmeeting

4 May 2023

Congratulations to our 2 Best Paper recipients and our Best System Demonstration recipients: docs.google.com/document/d/1… #EACL2023 #eacl #NLProc #bestpapers

14

3

75

20,159

Kanishka Misra 🌊@kanishkamisra

14 Nov 2024

🙏 really a lucky cherry on top because working with Kyle itself is an award!

Kyle Mahowald @kmahowald

14 Nov 2024

Chuffed that @kanishkamisra on “controlled rearing” to learn the “a beautiful five days in Miami” arxiv.org/abs/2403.19827 construction won an EMNLP Outstanding Paper Award! And delighted that the ACL community saw fit to recognize a paper about an odd little ling construction.

10

3

73

4,284

Kanishka Misra 🌊@kanishkamisra

28 Jul 2025

Looking forward to attending #cogsci2025! I’m especially excited to meet students who will be applying to PhD programs in Computational Ling/CogSci in the coming cycle. Please reach out if you want to meet up and chat! Email is best, but DM also works if you must quick🧵:

Placeholders for 3 students (number arbitrarily chosen) and me - to signify my eventual group!

ALT Placeholders for 3 students (number arbitrarily chosen) and me - to signify my eventual group!

1

24

74

11,047

Kanishka Misra 🌊@kanishkamisra

12 Jan 2024

Controlled zero-shot evals have revealed holes in LMs’ ability to robustly extract and use meaning. But what happens when you add experimental context (ICL/instructions)? With @AllysonEttinger & @kmahowald, I explore this in the context of semantic property inheritance: 1/13

Paper title: Experimental Contexts Can Facilitate Robust Semantic Property Inference in Language Models, but Inconsistently
Authors: Kanishka Misra, Allyson Ettinger, Kyle Mahowald

ALT Paper title: Experimental Contexts Can Facilitate Robust Semantic Property Inference in Language Models, but Inconsistently Authors: Kanishka Misra, Allyson Ettinger, Kyle Mahowald

2

13

68

19,019

Kanishka Misra 🌊@kanishkamisra

20 Feb 2023

Excited to finally share COMPS, a collection of english minimal pair stimuli to analyze conceptual knowledge in language models! Work with @RayzJulia and @AllysonEttinger, to be presented at EACL (main)! Paper: arxiv.org/abs/2210.01963 Code: github.com/kanishkamisra/com… Thread: ⬇️1/

3

13

64

10,048

Kanishka Misra 🌊@kanishkamisra

25 Mar 2022

*NEW PREPRINT* I built a simple python package called minicons, to facilitate behavioral and representational analyses of transformer LMs. paper: arxiv.org/abs/2203.13112 code: github.com/kanishkamisra/min… paper-experiments: github.com/kanishkamisra/min… 1/n

Title: minicons: Enabling Flexible Behavioral and Representational Analyses of Transformer Language Models.
Author: Kanishka Misra (email: kmisra@purdue.edu)

ALT Title: minicons: Enabling Flexible Behavioral and Representational Analyses of Transformer Language Models. Author: Kanishka Misra (email: kmisra@purdue.edu)

5

15

63

Kanishka Misra 🌊@kanishkamisra

23 Sep 2024

Excited that this work (w/ @kmahowald ) will be presented at #EMNLP2024! I must note that the ARR/@ReviewAcl experience for this paper was nothing short of excellent -- thanks to a thoughtful bunch of reviewers!

Kanishka Misra 🌊@kanishkamisra

2 Apr 2024

🔑 🗝️ The Article+Adjective+Numeral+Noun (e.g., “a lovely five days”) construction is quite rare, and yet people and LMs know it’s grammatical. What is the key to learning such a rare construction? @kmahowald and I answer this question in the context of LMs in our new paper:

Title page of our paper: “Language Models Learn Rare Phenomena from Less Rare Phenomena: The Case of the Missing AANNs” - Kanishka Misra and Kyle Mahowald

ALT Title page of our paper: “Language Models Learn Rare Phenomena from Less Rare Phenomena: The Case of the Missing AANNs” - Kanishka Misra and Kyle Mahowald

4

1

61

7,348

Kanishka Misra 🌊@kanishkamisra

10 Aug 2022

Heading to NYC to spend some time at Google Research as a fall intern! Excited to hopefully run into and learn from all the brilliant folks in the NYC NLP x CogSci community as well (psst, hit me up!) Maybe I will also get my 3-month traveling NOVID challenge badge 🤷‍♂️🙏

3

1

49

Kanishka Misra 🌊@kanishkamisra

1 Nov 2023

Excited to travel to Boston and present work on category abstraction in LMs w/ the wonderful @najoungkim at @TheBUCLD! Our poster presentation is on Nov 4! 🐈 🆎 Thread👇

Screenshot of the title of our BUCLD abstract: "Abstraction via exemplars? A representational case study on lexical category inference in BERT"

ALT Screenshot of the title of our BUCLD abstract: "Abstraction via exemplars? A representational case study on lexical category inference in BERT"

3

8

44

11,338

Kanishka Misra 🌊@kanishkamisra

30 Apr 2018

#TidyTuesday Submission for this week showing distribution of percentages of people in each state by employment. Data from Kaggle! #rstats #tidyverse #ggridges

6

3

36

Kanishka Misra 🌊@kanishkamisra

5 Jul 2018

#TidyTuesday after a long time! This time visualizing changes in life expectancy in Rwanda and Cambodia, two big outliers in the graph who also suffered in their life expectancy due to genocide :( Feedback welcome! #rstats #tidyverse

2

6

34

Kanishka Misra 🌊@kanishkamisra

17 Apr 2018

My first submission to #TidyTuesday that shows paths of randomly selected countries (to avoid 'bias') in their share of deaths in the top two causes of death. Data from @OurWorldInData #rstats #tidyverse

3

6

36

Kanishka Misra 🌊@kanishkamisra

20 Jun 2019

Implemented a basic feedforward neural network in R using @dvaughan32's rray and the R6 OOP system. Will start working on a blog post (or two) soon. But here's a preview. #rstats

3

2

37

Kanishka Misra 🌊@kanishkamisra

7 Jun 2024

Not one for bean counting, but minicons recently reached 100+ stars, and it's been a great source of fleeting joy in this month of craziness (7 days in)! Thanks everyone for all the support, especially contributors and ppl who use it to teach Comp-Ling classes!

screenshot from github (dark mode) showing 101 stars for minicons

ALT screenshot from github (dark mode) showing 101 stars for minicons

4

2

38

10,591

Kanishka Misra 🌊@kanishkamisra

12 Oct 2022

Interested in evaluating the ability of #NLProc language models to encode properties of everyday concepts such as robin/table/shark? About to/have used the CSLB or McRae property norms dataset? A long cautionary thread (1/10):

2

3

35

Kanishka Misra 🌊@kanishkamisra

8 Jul 2023

Believe it or not, #ACL2023NLP will be my first in-person ACL! Looking forward to meeting old friends and new! 🤩

3

32

5,421

Kanishka Misra 🌊@kanishkamisra

23 May 2025

Being super ambiguous (am sure everyone knows what I mean): I have no idea what to look forward to anymore, with the news I see every day. I’m lucky for all the opportunities I’ve gotten and am sad fewer people will be in my position in the future with the way things are going :(

2

34

3,654

Kanishka Misra 🌊@kanishkamisra

27 Mar 2019

Replying to @DalotDiogo @selecaoportugal @ManUtd

You're getting your first united goal this weekend, heard it here first

1

29

Kanishka Misra 🌊@kanishkamisra

26 Jan 2018

Replying to @ClockedPatch @hassaanishere @SumaaaaiL

He won TI..

2

33

Kanishka Misra 🌊@kanishkamisra

27 Oct 2022

Replying to @tallinzen

Seems like the model can’t address the elephant in the prompt :(

1

31

Kanishka Misra 🌊@kanishkamisra

24 Jul 2022

New paper to be presented on Saturday (July 30) at #CogSci2022 @cogsci_soc We (@AllysonEttinger, Julia Rayz, and I) present a paradigm to perform property induction using LLMs! paper: escholarship.org/uc/item/617… (WARNING: long thread 😬)

1

6

31

Kanishka Misra 🌊@kanishkamisra

17 Oct 2023

everyone hates prompt engineering until they actually do it, after which they hate it even more

4

2

31

4,407

Kanishka Misra 🌊@kanishkamisra

6 Apr 2024

Meta-reviewing some (potentially related) papers + spending more time with folks working on discourse has made me appreciate this great paper by @sebschu and @tallinzen even more: aclanthology.org/2022.naacl-…

When a sentence does not introduce a discourse entity, Transformer-based models still sometimes...

Sebastian Schuster, Tal Linzen. Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies. 2022.

aclanthology.org

2

2

32

8,155

Kanishka Misra 🌊@kanishkamisra

28 Jun 2017

Thanks to @drob and @thomasp85 for gganimate and tweenr!

3

8

32

Kanishka Misra 🌊@kanishkamisra

14 Jun 2023

10/10 title arxiv.org/abs/2305.19650

1

1

31

4,805

Kanishka Misra 🌊@kanishkamisra

11 Nov 2024

Excited to spend a beautiful five days in miami attending #EMNLP2024! I’m presenting two papers and am looking forward to meeting friends, foes, and undecideds!

photo of miami (4th result on google images)

ALT photo of miami (4th result on google images)

2

2

30

1,724

Kanishka Misra 🌊@kanishkamisra

15 Dec 2023

only took ~1 month to appear on arxiv (was on hold for some reason) but @najoungkim's and my BUCLD work is now out (and citable 😉) arxiv.org/abs/2312.03708

2

3

30

4,254

Kanishka Misra 🌊@kanishkamisra

14 Dec 2023

despite a professionally awesome year, mental health is down the dumpsters. here’s a yearly reminder to take care of yourselves no matter how many highs you’ve had!

3

1

29

1,980

Kanishka Misra 🌊@kanishkamisra

17 Jul 2025

I will unfortunately have to skip SCiL this year, but I am thrilled to share that Jwalanthi will be presenting this work by her, @Robro612, me, and @kmahowald on a tool that allows you to project contextualized embeddings from LMs to interpretable semantic spaces!

Title page of SCIL extended abstract titled: semantic-features: A User-Friendly Tool for Studying Contextual Word Embeddings in Interpretable Semantic Spaces

ALT Title page of SCIL extended abstract titled: semantic-features: A User-Friendly Tool for Studying Contextual Word Embeddings in Interpretable Semantic Spaces

2

3

32

1,263

Kanishka Misra 🌊@kanishkamisra

8 Jul 2025

colm sent us the acceptance announcement / colm sent the acceptance announcement to us

Qing Yao @qyao23

31 Mar 2025

LMs learn argument-based preferences for dative constructions (preferring recipient first when it’s shorter), being quite consistent with humans. Is this from just memorizing the preferences in their training data? New paper w/ @kanishkamisra, @LAWeissweiler, @kmahowald

examples from direct and prepositional object datives with short-first and long-first word orders:
DO (long first): She gave the boy who signed up for class and was excited it.
PO (short first): She gave it to the boy who signed up for class and was excited.
DO (short first): She gave him the book that everyone was excited to read.
PO (long-first): She gave the book that everyone was excited to read to him.

ALT examples from direct and prepositional object datives with short-first and long-first word orders: DO (long first): She gave the boy who signed up for class and was excited it. PO (short first): She gave it to the boy who signed up for class and was excited. DO (short first): She gave him the book that everyone was excited to read. PO (long-first): She gave the book that everyone was excited to read to him.

1

1

32

1,538

Kanishka Misra 🌊@kanishkamisra

27 Oct 2023

Come be my colleague at @TTIC_Connect starting next Fall and apply to their research assistant professor program! Up to 3 years of research funding + no teaching reqs + PI status! I am happy to share any insights I can offer! Deadline is December 1! facapp.ttic.edu/

10

30

8,674

Kanishka Misra 🌊@kanishkamisra

16 Apr 2021

Pleased to announce that our paper on investigating pre-trained language models for conceptual typicality (bird -> robin; furniture -> sofa) has been accepted as a talk to #cogsci2021 @cogsci_soc w/ @AllysonEttinger @RayzJulia (1/5)

3

4

29

Kanishka Misra 🌊@kanishkamisra

6 Feb 2018

Submitted my first code contributing PR to @drob's widyr package, feels great to have started contributing to the #rstats community! Will most probably blog about what I did along with examples! :D

1

23

Kanishka Misra 🌊@kanishkamisra

24 May 2022

@judyefan @flxbinder, Jay McClelland, & I are planning to organize an affinity group at @cogsci_soc this year on "Neural Network models of Human Cognition" We are looking for a volunteer to help with discussion groups on zoom/gather for folks attending remotely. DM if interested!

1

5

25

Kanishka Misra 🌊@kanishkamisra

6 Aug 2023

leaving purdue after 9 years (will be back to walk later this year)! It’s been real but looking forward to spending time at my new temp academic home at @UTAustin!

28

4,759

Kanishka Misra 🌊@kanishkamisra

12 Aug 2025

UT’s loss is NYU’s gain!! Sad to be missing Greg at UT but forever excited about what comes out of TAUR Lab!! (And that’s one fantastic logo migration job)

Greg Durrett

@gregd_nlp

11 Aug 2025

📢I'm joining NYU (Courant CS + Center for Data Science) starting this fall! I’m excited to connect with new NYU colleagues and keep working on LLM reasoning, reliability, coding, creativity, and more! I’m also looking to build connections in the NYC area more broadly. Please reach out if you're interested in chatting! This move comes after 8 years working with incredible students and collaborators at UT Austin. Thank you to everyone who supported me in my first academic appointment; I look forward to continuing our collaborations but I will miss you! (and the breakfast tacos!)

28

2,323

Kanishka Misra 🌊@kanishkamisra

8 Nov 2024

So excited about this work: taking inspiration from psych and combining it with tools from interpretability research to analyze LMs! Read the thread + paper for the science; in the meantime I will use this space to talk about how awesome it was to work with my co-authors:

Juan Diego Rodríguez (he/him)@juand_r_nlp

8 Nov 2024

How do language models organize concepts and their properties? Do they use taxonomies to infer new properties, or infer based on concept similarities? Apparently, both! 🌟 New paper with my fantastic collaborators @amuuueller and @kanishkamisra!

Title: "Characterizing the Role of Similarity in the Property Inferences of Language Models"
Authors: Juan Diego Rodriguez, Aaron Mueller, Kanishka Misra

Left figure: "Given that dogs are daxable, is it true that corgis are daxable?" A language model could answer this either using taxonomic relations, illustrated by a taxonomy dog-corgi, dog-mutt, canine-wolf, etc., or by similarity relations (dogs are more similar to corgis than cats, wolves or shar peis).

Right figure: illustration of the causal model (and an example intervention) for distributed alignment search (DAS), which we used to find a subspace in the network responsible for property inheritance behavior. The bottom nodes are "property", "premise concept (A)" and "conclusion concept (B)" , the middle nodes are "A has property P", "B is a kind of A", and the top node is "B has property P".

ALT Title: "Characterizing the Role of Similarity in the Property Inferences of Language Models" Authors: Juan Diego Rodriguez, Aaron Mueller, Kanishka Misra Left figure: "Given that dogs are daxable, is it true that corgis are daxable?" A language model could answer this either using taxonomic relations, illustrated by a taxonomy dog-corgi, dog-mutt, canine-wolf, etc., or by similarity relations (dogs are more similar to corgis than cats, wolves or shar peis). Right figure: illustration of the causal model (and an example intervention) for distributed alignment search (DAS), which we used to find a subspace in the network responsible for property inheritance behavior. The bottom nodes are "property", "premise concept (A)" and "conclusion concept (B)" , the middle nodes are "A has property P", "B is a kind of A", and the top node is "B has property P".

1

26

1,964

Kanishka Misra 🌊@kanishkamisra

18 Oct 2024

Happy that this too was accepted at #EMNLP2024! Check out the updated arxiv version:

Kanishka Misra 🌊@kanishkamisra

12 Jan 2024

Controlled zero-shot evals have revealed holes in LMs’ ability to robustly extract and use meaning. But what happens when you add experimental context (ICL/instructions)? With @AllysonEttinger & @kmahowald, I explore this in the context of semantic property inheritance: 1/13

Paper title: Experimental Contexts Can Facilitate Robust Semantic Property Inference in Language Models, but Inconsistently
Authors: Kanishka Misra, Allyson Ettinger, Kyle Mahowald

ALT Paper title: Experimental Contexts Can Facilitate Robust Semantic Property Inference in Language Models, but Inconsistently Authors: Kanishka Misra, Allyson Ettinger, Kyle Mahowald

2

2

27

2,509

Kanishka Misra 🌊@kanishkamisra

23 Oct 2023

did you all know that CogSci/psychology based evals of LMs only started in 2022????!!!!

2

1

23

2,942

Kanishka Misra 🌊@kanishkamisra

13 Nov 2024

A bunch of amazing people and me @ Versailles!

2

26

3,897

Kanishka Misra 🌊@kanishkamisra

9 Jul 2023

Lucky to be hosted by one of my oldest friends who happens to have this view from his balcony, so blessed!! See you all at #ACL2023NLP tomorrow! ❤️❤️

1

24

3,049

Kanishka Misra 🌊@kanishkamisra

21 Mar 2024

☀️minicons 🌖 now supports sequence scoring with Vision-Language Models!! Looking forward to see how ppl use it (if at all!) -- feedback always welcome! 🐧🐦

code demonstrating how BLIP2-OPT-2.7B can be used queried to see if a penguin vs. a cardinal is more likely to fly, given their respective images.

import torch

from minicons import scorer
from PIL import Image

penguin = Image.open('penguin.jpg')
cardinal = Image.open('cardinal.jpg')

lm = scorer.VLMScorer(
"Salesforce/blip2-opt-2.7b",
device="cuda:0"
)

lm.sequence_score(
text_batch=["This bird can fly."] * 2,
image_batch=[penguin, cardinal]
)

#> logprobs of penguin vs cardinal -> can fly
#> [-5.644123077392578, -5.129026889801025]

ALT code demonstrating how BLIP2-OPT-2.7B can be used queried to see if a penguin vs. a cardinal is more likely to fly, given their respective images. import torch from minicons import scorer from PIL import Image penguin = Image.open('penguin.jpg') cardinal = Image.open('cardinal.jpg') lm = scorer.VLMScorer( "Salesforce/blip2-opt-2.7b", device="cuda:0" ) lm.sequence_score( text_batch=["This bird can fly."] * 2, image_batch=[penguin, cardinal] ) #> logprobs of penguin vs cardinal -> can fly #> [-5.644123077392578, -5.129026889801025]

4

2

24

2,339

Kanishka Misra 🌊@kanishkamisra

2 Jan 2025

Extremely heartbreaking :( Felix’ work has had a deep impact on me and I had hoped to meet and talk to him some day — my heartfelt condolences to his loved ones.. Please check in on your friends :(

Douwe Kiela

@douwekiela

2 Jan 2025

I’m really sad that my dear friend @FelixHill84 is no longer with us. He had many friends and colleagues all over the world - to try to ensure we reach them, his family have asked to share this webpage for the celebration of his life: pp.events/felix

1

1

24

2,231

Kanishka Misra 🌊@kanishkamisra

24 May 2025

If em dashes are distinctive of LM-generated writing then most of my papers are LM-generated :( not cool

2

1

24

2,342

Kanishka Misra 🌊@kanishkamisra

25 Nov 2024

There's a known bug in how we compute "word" probabilities with subword-based LMs that mark beginnings of words -- as pointed out by @byungdoh (w/ Will Schuler), & @tpimentelms and @clara__meister I'm pleased to announce that minicons now includes a fix which runs batch-wise!

Code:

from minicons import scorer

lm = scorer.IncrementalLMScorer("gpt2-xl", "cuda:0")

stimuli = ["I was a matron in France", "I was a mat in France"]

# old way, no correction
# P.S. gpt2 does not automatically add a bos token at the beginning...
lm.token_score(stimuli, bos_token=True, surprisal=True, base_two=True, bow_correction=False)

'''Rounded Output
[[('<|endoftext|>', 0.0),
('I', 5.85),
('was', 4.28),
('a', 4.67),
('mat', 16.34),
('ron', 1.74),
('in', 2.12),
('France', 11.43)],
[('<|endoftext|>', 0.0),
('I', 5.85),
('was', 4.28),
('a', 4.67),
('mat', 16.34),
('in', 10.78),
('France', 10.71)]]
'''

# the new way! notice the surprisal of "mat" in both cases
lm.token_score(stimuli, bos_token=True, surprisal=True, base_two=True, bow_correction=True)

'''Rounded Output
[[('<|endoftext|>', 0.0),
('I', 6.30),
('was', 3.84),
('a', 4.68),
('mat', 16.34),
('ron', 2.11),
('in', 1.75),
('France', 11.42)],
[('<|endoftext|>', 0.0),
('I', 6.30),

ALT Code: from minicons import scorer lm = scorer.IncrementalLMScorer("gpt2-xl", "cuda:0") stimuli = ["I was a matron in France", "I was a mat in France"] # old way, no correction # P.S. gpt2 does not automatically add a bos token at the beginning... lm.token_score(stimuli, bos_token=True, surprisal=True, base_two=True, bow_correction=False) '''Rounded Output [[('<|endoftext|>', 0.0), ('I', 5.85), ('was', 4.28), ('a', 4.67), ('mat', 16.34), ('ron', 1.74), ('in', 2.12), ('France', 11.43)], [('<|endoftext|>', 0.0), ('I', 5.85), ('was', 4.28), ('a', 4.67), ('mat', 16.34), ('in', 10.78), ('France', 10.71)]] ''' # the new way! notice the surprisal of "mat" in both cases lm.token_score(stimuli, bos_token=True, surprisal=True, base_two=True, bow_correction=True) '''Rounded Output [[('<|endoftext|>', 0.0), ('I', 6.30), ('was', 3.84), ('a', 4.68), ('mat', 16.34), ('ron', 2.11), ('in', 1.75), ('France', 11.42)], [('<|endoftext|>', 0.0), ('I', 6.30),

Screenshot from Oh and Schuler showing surprisal values for the partial sentences "I was a matron in" and "I was a mat in" using GPT-2 XL with leading whitespaces and trailing whitespaces.

ALT Screenshot from Oh and Schuler showing surprisal values for the partial sentences "I was a matron in" and "I was a mat in" using GPT-2 XL with leading whitespaces and trailing whitespaces.

2

3

24

2,128

Kanishka Misra 🌊@kanishkamisra

21 Jul 2025

"Seeing" robins and sparrows may not necessarily make them birdier to LMs! Super excited about this paper -- massive shoutout to all my co-authors, especially @yulu_qin and @dhevarghese for leading the charge!

Yulu Qin @yulu_qin

21 Jul 2025

Does vision training change how language is represented and used in meaningful ways?🤔 The answer is a nuanced yes! Comparing VLM-LM minimal pairs, we find that while the taxonomic organization of the lexicon is similar, VLMs are better at _deploying_ this knowledge. [1/9]

4

22

1,348

Kanishka Misra 🌊@kanishkamisra

25 Jan 2021

"Definitely Accidental" art @accidental__aRt

1

18

Kanishka Misra 🌊@kanishkamisra

26 Jul 2021

The graded status of concepts is central to cogsci research -- some items (robins) are more typical members of their category (birds) than are others (penguins). Is this something that is picked up by current #NLProc language models? 1/3

1

2

20

Kanishka Misra 🌊@kanishkamisra

24 Dec 2023

give me logprobs, at least for the inputs tyvm

Sam Altman

@sama

23 Dec 2023

what would you like openai to build/fix in 2024?

1

21

2,741

Kanishka Misra 🌊@kanishkamisra

8 Jul 2023

Happy to talk about CogSci based eval of nlp systems (perils/non-perils), conceptual meaning understanding, wugs, daxes, and your own research at #ACL2023NLP!

Kanishka Misra 🌊@kanishkamisra

8 Jul 2023

Believe it or not, #ACL2023NLP will be my first in-person ACL! Looking forward to meeting old friends and new! 🤩

1

19

2,028

Kanishka Misra 🌊@kanishkamisra

27 Dec 2021

A welcome change to the CogSci 2022 submission policy! Thanks @cogsci_soc!

A screenshot of the cogsci 2022 paper submission policy, with "unlimited pages for references" underlined

ALT A screenshot of the cogsci 2022 paper submission policy, with "unlimited pages for references" underlined

1

2

18

Kanishka Misra 🌊@kanishkamisra

31 Dec 2018

2019 #rstats goals: 1. Learn more rcpp! 2. Help improve NLP tools in R (easy access to word and sentence representations would be a start) 3. Start an R seminar at school

1

15

Kanishka Misra 🌊@kanishkamisra

7 Oct 2024

In case people recovering from colm want more lm content:

TTIC @TTIC_Connect

7 Oct 2024

October 11th at 12:30pm CT: Research@TTIC presents Kanishka Misra (@kanishkamisra) with a talk titled "Controlled Rearing of Language Models can Reveal Linguistic Insight." Please join us in Room 530, or stream via Panopto: buff.ly/3NeZvQS

2

19

2,346

Kanishka Misra 🌊@kanishkamisra

22 Nov 2024

So ARR doesn't want any of us to have a weekend... Got it! 4 days (2 of which = weekend) for rebuttals is a bit too crazy, friends!

2

20

1,760

Kanishka Misra 🌊@kanishkamisra

18 Oct 2024

If the EMNLP presentation format is either oral OR poster, why do the authors of papers that got an oral slot have to make posters? Where will these posters go?

screenshot from emnlp 2024 website. Salient info: all papers accepted to the main conf have to submit a pre-recorded video, a poster pdf, as well as slides.

ALT screenshot from emnlp 2024 website. Salient info: all papers accepted to the main conf have to submit a pre-recorded video, a poster pdf, as well as slides.

1

1

19

13,155

Kanishka Misra 🌊@kanishkamisra

8 Feb 2021

Replying to @deliprao

The Big Book of Concepts by Greg Murphy (thanks to @LakeBrenden who used it in his course, which is how I stumbled upon the book)

1

1

18

Kanishka Misra 🌊@kanishkamisra

20 Jun 2024

A big todo for minicons! 👀

Tiago Pimentel

@tpimentelms

20 Jun 2024

Hey #NLProc and #psycholing Twitter :) We found a bug in how we're all computing contextual word probabilities and wrote a paper about it! It's a very easy fix, so please check it out! +@clara__meister

2

18

1,700

Kanishka Misra 🌊@kanishkamisra

6 Sep 2018

Another analysis inspired by @kearneymw 's really quick work on the authorship of the nyt op-ed. Possible improvements: weigh function words more - works wonders in stylometry sometimes, get better data(than tweets) Code: github.com/kanishkamisra/ins…

2

5

17

Kanishka Misra 🌊@kanishkamisra

8 Nov 2023

Go work with the amazing @kmahowald, @jessyjli, and Katrin! Bonus: also get a chance to interact with @gregd_nlp and @eunsolc and their groups!

Kyle Mahowald @kmahowald

8 Nov 2023

I (+ others) are accepting grad students here in Austin! We have an exceptionally vibrant comp ling community across linguistics and CS here at UT nlp.utexas.edu/. I’m based in linguistics but happy to work across departments. Topics I can or hope to be able to fund:

2

16

2,582

Kanishka Misra 🌊@kanishkamisra

9 May 2018

Week 6 #TidyTuesday submission, this one shows the spread of Starbucks and Dunkin' Donuts coffee chains in the top 10 states in USA by coffee shops. I STREAMED while coding this so if you are interested in watching that, check out twitch.tv/iamasharkskin #rstats #dataviz

2

2

13

Kanishka Misra 🌊@kanishkamisra

5 Aug 2025

can someone pls remove the "em-dashes" feature from all LLMs??? I refuse to give up on 'em

17

1,668

Kanishka Misra 🌊@kanishkamisra

15 Jun 2024

Reducing naacl fomo by staring at Lake Michigan

lake michigan

ALT lake michigan

19

742

Kanishka Misra 🌊@kanishkamisra

4 Sep 2024

Replying to @yoavartzi

colm-base, colm-small, colm-medium, colm-large, … 🚪🚶‍♂️

18

519

Kanishka Misra 🌊@kanishkamisra

1 Nov 2023

2-page abstract (and poster) now available on kanishka.website/publication… Also check out the rest of the program curated by BUCLD's hard-working committee (of students, iirc!): bu.edu/bucld/schedule/ Glad that @najoungkim introduced this conf to me!

Abstraction via exemplars? A representational case study on lexical category inference in BERT |...

Work on understanding the representational dynamics underlying category-abstraction in LMs such as BERT.

kanishka.website

Kanishka Misra 🌊@kanishkamisra

1 Nov 2023

Excited to travel to Boston and present work on category abstraction in LMs w/ the wonderful @najoungkim at @TheBUCLD! Our poster presentation is on Nov 4! 🐈 🆎 Thread👇

Screenshot of the title of our BUCLD abstract: "Abstraction via exemplars? A representational case study on lexical category inference in BERT"

ALT Screenshot of the title of our BUCLD abstract: "Abstraction via exemplars? A representational case study on lexical category inference in BERT"

3

2

18

3,172

Kanishka Misra 🌊@kanishkamisra

10 Jul 2023

ACL day one = blessed! Finally met @sebschu irl, and two people came to me to say nice things about minicons! 😇

16

1,363

Kanishka Misra 🌊@kanishkamisra

16 Jun 2023

I bet the first thing I’ll ctrl f in my finished draft of the dissertation will be “langauge”

1

15

657

Kanishka Misra 🌊@kanishkamisra

30 Jul 2024

can’t be disappointed in your NeurIPS reviews if you don’t submit :thinksmart:

16

2,519

Kanishka Misra 🌊@kanishkamisra

24 Apr 2016

My timeline is full of Game of Thrones posts what's happening guys #GameofThrones

10

Kanishka Misra 🌊@kanishkamisra

16 Oct 2023

colm : collum :: lm : lum?

1

1

14

7,156

Kanishka Misra 🌊@kanishkamisra

25 Oct 2024

Such a timely piece about the role of NNs in CogSci 🤖! A bit of a shameless self promo, but also check out recent work from @najoungkim and me on generating experimental hypotheses from LMs (for datives), super relevant to discussion in Section 2.4: arxiv.org/abs/2408.05086

A systematic framework for generating novel experimental...

Neural language models (LMs) have been shown to capture complex linguistic patterns, yet their utility in understanding human language and more broadly, human cognition, remains debated. While...

Eva Portelance @EvaPortelance

25 Oct 2024

New paper out 🎉 !! @linguistMasoud and I wrote a perspective on how modern LMs and AI models can be useful for studying language acquisition in kids. Check out the open source paper in Language and Linguistics Compass: dx.doi.org/10.1111/lnc3.7000… and this original thread:

1

3

17

1,808

Kanishka Misra 🌊@kanishkamisra

13 Apr 2024

All I could think about when I saw the high school paper track: Gotta start them young!

2

15

1,436

Kanishka Misra 🌊@kanishkamisra

3 Dec 2022

"I happy to shed lighter"

screenshot from an email I am writing, with microsoft's grammar editor recommending I change "more light" in the phrase "I am happy to shed more light..." to "lighter"

ALT screenshot from an email I am writing, with microsoft's grammar editor recommending I change "more light" in the phrase "I am happy to shed more light..." to "lighter"

17

Kanishka Misra 🌊@kanishkamisra

15 Oct 2023

Knew this question would come some day...

screenshot of a github issue created on github.com/kanishkamisra/minicons asking whether I named the package 'minicons' due to its relevance in the fictional transformers universe. I replied with "yes"

ALT screenshot of a github issue created on github.com/kanishkamisra/minicons asking whether I named the package 'minicons' due to its relevance in the fictional transformers universe. I replied with "yes"

2

16

2,450

Kanishka Misra 🌊@kanishkamisra

2 Apr 2024

For hepful comments and conversations we thank @adelegoldberg1 and @LAWeissweiler! Thanks also to @ChrisGPotts for his PiPPaper that inspired the keys to it all idea 🙇‍♂️ Paper: arxiv.org/abs/2403.19827

Language Models Learn Rare Phenomena from Less Rare Phenomena: The...

Language models learn rare syntactic phenomena, but the extent to which this is attributable to generalization vs. memorization is a major open question. To that end, we iteratively trained...

1

16

603

Kanishka Misra 🌊@kanishkamisra

30 Mar 2024

the colm before the storm

1

1

18

2,571

Kanishka Misra 🌊@kanishkamisra

13 Jun 2023

minicons now supports the `within word l2r' method for masked language model scoring, developed by the amazing @KaufCarina (thanks for the PR!) and @neuranna 🎉🤖

screenshot of python code that allows you to use within word l2r method developed by Carina Kauf and Anna Ivanova.

ALT screenshot of python code that allows you to use within word l2r method developed by Carina Kauf and Anna Ivanova.

1

2

15

1,241

Kanishka Misra 🌊@kanishkamisra

9 Aug 2023

lower your expectations and amazing things will happen to you!

1

2

14

2,460

Kanishka Misra 🌊@kanishkamisra

15 Sep 2020

Pleased to announce that my paper with @AllysonEttinger and Julia Rayz has been accepted for publication in Findings of #emnlp2020.

1

2

15

Kanishka Misra 🌊@kanishkamisra

8 Nov 2023

Due to @amuuueller's wise insistence, minicons now supports multiple devices and quantization! You can now quickly run log-prob based evals on huuuge LMs!

screenshot demonstrating how one can use quantization using minicons, and also run stuff on multiple gpus. Here's the code:

# run `pip install bitsandbytes` first
import torch
from transformers import BitsAndBytesConfig
from minicons import scorer

# multiple device support
lm = scorer.IncrementalLMScorer("meta-llama/Llama-2-13b-hf", device="auto")

# multiple device + quantization support
bnb_config = BitsAndBytesConfig(
load_in_4bit=True,
bnb_4bit_use_double_quant=True,
bnb_4bit_quant_type="nf4",
bnb_4bit_compute_dtype=torch.bfloat16,
)

lm = scorer.IncrementalLMScorer(
"meta-llama/Llama-2-13b-hf", device="auto", quantization_config=bnb_config
)

# takes ~ 8gb on a A40 GPU

ALT screenshot demonstrating how one can use quantization using minicons, and also run stuff on multiple gpus. Here's the code: # run `pip install bitsandbytes` first import torch from transformers import BitsAndBytesConfig from minicons import scorer # multiple device support lm = scorer.IncrementalLMScorer("meta-llama/Llama-2-13b-hf", device="auto") # multiple device + quantization support bnb_config = BitsAndBytesConfig( load_in_4bit=True, bnb_4bit_use_double_quant=True, bnb_4bit_quant_type="nf4", bnb_4bit_compute_dtype=torch.bfloat16, ) lm = scorer.IncrementalLMScorer( "meta-llama/Llama-2-13b-hf", device="auto", quantization_config=bnb_config ) # takes ~ 8gb on a A40 GPU

1

14

1,154

Kanishka Misra 🌊@kanishkamisra

6 Aug 2025

Why does neurips hide the reviewers' updated scores (or lack thereof)? "this is to maximize misery for everyone" - aptly described by @najoungkim

16

2,095

Kanishka Misra 🌊@kanishkamisra

11 Apr 2024

Replying to @ChrisGPotts @tallinzen

If you look deep enough, I think that you might find that understanding means: tensor([[-0.0056, -0.0124, -0.0363, ..., 0.0041, 0.0111, 0.0141]])

16

351

Kanishka Misra 🌊@kanishkamisra

2 Jun 2022

@najoungkim and I have decided to curate a (growing) list of papers that that focus on the integration and and evaluation of novel words in NLP systems: github.com/kanishkamisra/wug… please feel free to add more papers/suggestions on organizing this list!

GitHub - kanishkamisra/wugs-and-daxes: Collection of academic works in natural language processing,...

Collection of academic works in natural language processing, computational linguistics, and computational cognitive science that study the problem of novel words in models/agents. - kanishkamisra/w...

2

5

15

Kanishka Misra 🌊@kanishkamisra

2 Apr 2025

another day another minicons update (potentially a significant one for psycholinguists?) "Word" scoring is now a thing! You just have to supply your own splitting function! pip install -U minicons for merriment

from minicons import scorer
from nltk.tokenize import TweetTokenizer

lm = scorer.IncrementalLMScorer("gpt2")

# your own tokenizer function that returns a list of words
# given some sentence input
word_tokenizer = TweetTokenizer().tokenize

# word scoring
lm.word_score_tokenized(
["I was a matron in France", "I was a mat in France"],
bos_token=True, # needed for GPT-2/Pythia and NOT needed for others
tokenize_function=word_tokenizer,
bow_correction=True, # Oh and Schuler correction
surprisal=True,
base_two=True
)

'''
First word = -log_2 P(word | <beginning of text>)

[[('I', 6.1522440910339355),
('was', 4.033324718475342),
('a', 4.879510402679443),
('matron', 17.611848831176758),
('in', 2.5804288387298584),
('France', 9.036953926086426)],
[('I', 6.1522440910339355),
('was', 4.033324718475342),
('a', 4.879510402679443),
('mat', 19.385351181030273),
('in', 6.76780366897583),
('France', 10.574726104736328)]]
'''

ALT from minicons import scorer from nltk.tokenize import TweetTokenizer lm = scorer.IncrementalLMScorer("gpt2") # your own tokenizer function that returns a list of words # given some sentence input word_tokenizer = TweetTokenizer().tokenize # word scoring lm.word_score_tokenized( ["I was a matron in France", "I was a mat in France"], bos_token=True, # needed for GPT-2/Pythia and NOT needed for others tokenize_function=word_tokenizer, bow_correction=True, # Oh and Schuler correction surprisal=True, base_two=True ) ''' First word = -log_2 P(word | <beginning of text>) [[('I', 6.1522440910339355), ('was', 4.033324718475342), ('a', 4.879510402679443), ('matron', 17.611848831176758), ('in', 2.5804288387298584), ('France', 9.036953926086426)], [('I', 6.1522440910339355), ('was', 4.033324718475342), ('a', 4.879510402679443), ('mat', 19.385351181030273), ('in', 6.76780366897583), ('France', 10.574726104736328)]] '''

1

1

16

591

Kanishka Misra 🌊@kanishkamisra

15 Jul 2024

Replying to @_jennhu

Congrats to both J HUs!

1

15

1,096

Kanishka Misra 🌊@kanishkamisra

27 Feb 2024

I love how the ao-childes validation set ends:

screenshot of the last 3 lines in the dev set of aochildes saying:

aha .
is the computer a type of pencil ?
we're all pencils .

ALT screenshot of the last 3 lines in the dev set of aochildes saying: aha . is the computer a type of pencil ? we're all pencils .

1

13

1,479

Kanishka Misra 🌊@kanishkamisra

24 Oct 2022

Replying to @tallinzen

Some papers that immediately come to mind for CogSci -> NLP: - arxiv.org/abs/2008.01766 - pnas.org/doi/full/10.1073/pn… - all responses to @Joe_Pater’s call for DL x Ling (I guess these would go both ways)

1

1

15

Kanishka Misra 🌊@kanishkamisra

23 Jan 2025

have good academic news but you'd have to go to site with blue colored sky to check it out, sorry

15

583

Kanishka Misra 🌊@kanishkamisra

2 May 2023

Come chat with me about robins, penguins, wugs, daxes, and a sprinkle of inverse scaling in LMs at the virtual poster session #3 on gather town! #eacl2023

very contentful poster about COMPS. Paper: https://arxiv.org/abs/2210.01963

ALT very contentful poster about COMPS. Paper: https://arxiv.org/abs/2210.01963

Kanishka Misra 🌊@kanishkamisra

30 Apr 2023

Presenting COMPS remotely at #EACL this week! Join me at the Virtual Poster Session #3 (May 2 @ 2:15 pm CEST) or come to my talk in the interpretability session (May 3 @ 9 am CEST)

1

15

1,465

Kanishka Misra 🌊@kanishkamisra

16 Nov 2024

Woohoo go tinlab! Congrats @HayleyRossLing @TeaAnd_OrCoffee @najoungkim!!

GenBench @GenBench

16 Nov 2024

Replying to @GenBench

Best paper!

1

16

1,271

Kanishka Misra 🌊@kanishkamisra

25 Jun 2023

soap: sick of ai papers

1

1

13

4,107

Kanishka Misra 🌊@kanishkamisra

9 Jul 2023

If people are just now trying to reach their hotel in Toronto and crying because of the traffic, it’s because of a Beyoncé concert. ur welcome!

2

1

14

2,017

Kanishka Misra 🌊@kanishkamisra

16 May 2024

statistics = only fun when it works

1

14

856