Desh Raj · Jan 12, 2026 · 4:27 PM UTC

Desh Raj

Pinned Tweet

Desh Raj

@rdesh26

Jan 12

I’m happy to share that I’m starting a new position as Senior Research Scientist at @nvidia! Looking forward to open science for speech full-duplex models :)

Desh Raj

@rdesh26

Jan 11

After 2 wonderful years, I left Meta this week. During this time, I worked on several projects related to speech and LLMs: - Built the first multi-channel audio foundation model with M-BEST-RQ (arxiv.org/abs/2409.11494) - Made ASR with SpeechLLMs faster (arxiv.org/abs/2409.08148) and more accurate (ieeexplore.ieee.org/document…) - Shipped the first production-ready full-duplex voice assistant (about.fb.com/news/2025/04/in…) - Improved Moshi’s reasoning capability with chain-of-thought (arxiv.org/abs/2510.07497) I am grateful to my managers for having my back on critical projects, and fortunate to have collaborated with several brilliant researchers and engineers during this time. As to what's next, I am still in NYC and continuing to do speech research. More on that later!

523

32,035

Desh Raj · Apr 3, 2024 · 2:03 PM UTC

Desh Raj

@rdesh26

3 Apr 2024

H1B lottery ❌ It was less than a 1 in 3 chance, but sucks anyway!

106

910

1,732,078

Desh Raj · Apr 28, 2022 · 12:06 AM UTC

Desh Raj

@rdesh26

28 Apr 2022

Any #SpeechProc person who hasn't been living under a rock for the last few years has definitely come across the "transducer" model (formerly known as the RNN-Transducer). They look like this: (1/n)

249

Desh Raj · Apr 3, 2024 · 10:48 PM UTC

Desh Raj

@rdesh26

3 Apr 2024

I feel like everyone that followed me after the H1B post will be supremely disappointed when I resume posting long threads about neural transducers 👀

226

54,908

Desh Raj · Apr 4, 2024 · 4:14 PM UTC

Desh Raj

@rdesh26

4 Apr 2024

Thanks all for your support and suggestions. Fortunately, I still have other options that I'll be pursuing with my employer. I have restricted replies to this post now so I can get back to doing research in peace :)

212

49,757

Desh Raj · May 22, 2024 · 3:47 PM UTC

Desh Raj

@rdesh26

22 May 2024

5 years, 4 months, and 26 days. Thank you, @JohnsHopkins!

211

10,924

Desh Raj · Oct 19, 2022 · 8:07 PM UTC

Desh Raj

@rdesh26

19 Oct 2022

Replying to @the_transit_guy

I am quite fond of taking the MARC train from Baltimore to DC for $9.

175

Desh Raj · Jan 26, 2024 · 6:58 PM UTC

Desh Raj

@rdesh26

26 Jan 2024

Thank you @jhuclsp and @JohnsHopkins for a memorable 5.5 years of my life! Excited to pursue new challenges at @AIatMeta 🎉

JHU CLSP @jhuclsp

26 Jan 2024

Congratulations to @rdesh26 (and adviser Sanjeev Khudanpur) on successfully defending his PhD thesis: Listening to Multi-Talker Conversations: Modular and End-to-End Perspectives. Next stop? @AIatMeta desh2608.github.io

158

12,352

Desh Raj · Jan 20, 2024 · 8:01 PM UTC

Desh Raj

@rdesh26

20 Jan 2024

📢📢 **Defending my PhD in a week** Date & Time: January 26, 2024, 9 to 11 AM EST Committee: Sanjeev Khudanpur, Dan Povey, Jinyu Li Dissertation Title: "Listening to multi-talker conversations: Modular and end-to-end perspectives" DM me for a Zoom link if interested 😀

157

14,963

Desh Raj · Feb 17, 2023 · 2:43 AM UTC

Desh Raj

@rdesh26

17 Feb 2023

ASR is speech-to-text. Today, let me tell you about "target-speaker ASR" (and our papers accepted at @ieeeICASSP). When we have several people talking, we may want to transcribe JUST one of them. E.g., to suppress background speech in noisy environments, etc. (1/n)

152

24,670

Desh Raj · Sep 15, 2023 · 6:52 PM UTC

Desh Raj

@rdesh26

15 Sep 2023

If you work on speech/NLP, you must have come across the quote: "Every time I fire a linguist, the performance of the speech recognizer goes up." This quote is attributed to Dr. Frederick Jelinek.

119

19,578

Desh Raj · Oct 5, 2022 · 3:46 PM UTC

Desh Raj

@rdesh26

5 Oct 2022

📢📢 I am thrilled to be selected as one of the inaugural AI2AI fellows for 2022-23 under the JHU+Amazon "Initiative for Interactive AI". 🎉 Eternally grateful to my advisor and collaborators! Congratulations to my fellow, um, fellows: ai2ai.engineering.jhu.edu/20…

120

Desh Raj · May 4, 2022 · 7:00 PM UTC

Desh Raj

@rdesh26

4 May 2022

I passed my GBO (qualifying exam) today and officially became a Ph.D. candidate! 🥳

117

Desh Raj · Sep 15, 2023 · 6:52 PM UTC

Desh Raj

@rdesh26

15 Sep 2023

Yesterday, the dean informed me that I have been selected as the latest recepient for the Fred Jelinek fellowship! I am extremely honored by this recognition, and I'm aware that it puts me in esteemed company. I will keep working hard to keep Jelinek's legacy alive!

112

7,365

Desh Raj · Sep 21, 2022 · 10:21 PM UTC

Desh Raj

@rdesh26

21 Sep 2022

1.5B Whisper model trained on 680k hours of speech gets 36.9% WER on AMI SDM. 34M Kaldi model trained on 100 hours of AMI train set gets 35.1%. Adaptation for multi-talker room audio conditions is very much an open problem.

105

Desh Raj · Feb 15, 2024 · 3:28 PM UTC

Desh Raj

@rdesh26

15 Feb 2024

**Dissertation now available** 📜: arxiv.org/abs/2402.08932 📽️: desh2608.github.io/static/pp… ⏯️: piped.video/watch?v=iKnCUHIg… It's a 332-page tome, but I have summarized it in this thread 👇 1/n

Listening to Multi-talker Conversations: Modular and End-to-end...

Since the first speech recognition systems were built more than 30 years ago, improvement in voice technology has enabled applications such as smart assistants and automated customer support....

arxiv.org

arXiv Sound @ArxivSound

15 Feb 2024

``Listening to Multi-talker Conversations: Modular and End-to-end Perspectives,'' Desh Raj, ift.tt/iHVD9Nk

102

84,858

Desh Raj · Jun 2, 2021 · 1:39 PM UTC

Desh Raj

@rdesh26

2 Jun 2021

After getting straight rejects for the last 2 years, I finally received some love from @INTERSPEECH2021. Congratulations and loads of gratitude to my co-authors! Can we fly to Brno already?! ✨

Desh Raj · Jan 26, 2022 · 6:55 PM UTC

Desh Raj

@rdesh26

26 Jan 2022

It's my annual reminder to myself that I got 12/12 rejections the first time I applied for PhD. You only need 1 person to believe in you! (For me that person was Dan Povey)

Eugene Vinitsky 🦋@EugeneVinitsky

26 Jan 2022

As PhD rejections start, just to normalize your expectations I went: 0 / 13 my first time applying 3 / 11 the second time 1 / 2 after dropping out of a physics PhD and switching to controls and honestly my life is so much better for those rejections

Desh Raj · Jul 27, 2020 · 11:12 PM UTC

Desh Raj

@rdesh26

27 Jul 2020

The first time I applied to PhD programs (in my senior year of undergrad), I got rejected everywhere. On Saturday, I got 2/2 papers rejected at Interspeech.

Gautam Kamath @thegautamkamath

27 Jul 2020

Academics: It happens to all of us, but we generally only project our triumphs and victories -- share a time you failed or got rejected. #AcademicChatter #AcademicTwitter

Desh Raj · Apr 27, 2022 · 5:12 PM UTC

Desh Raj

@rdesh26

27 Apr 2022

Replying to @ash1eyruba @ashleyruba

That's a fair point, but the same argument could be made for doing a PhD vs. getting a job straight out of undergrad. 5-6 years x ~50k per year = ~300k USD. Is it worth it? Depends on what you get out of the PhD (and not just what job you land after).

Desh Raj · Sep 13, 2019 · 2:39 PM UTC

Desh Raj

@rdesh26

13 Sep 2019

First paper accepted as a PhD student :) @asru2019 "Probing the information encoded in x-vectors"

Desh Raj · Nov 3, 2020 · 1:46 AM UTC

Desh Raj

@rdesh26

3 Nov 2020

📢 4 papers (3 first author, 1 other) accepted at IEEE SLT 2021 😄 Thanks to all my wonderful collaborators and reviewers! Will be putting up the papers (and code) on ArXiv in the next few days. If you are interested in diarization, separation, or ASR, do take a look :)

Desh Raj · Aug 29, 2023 · 12:33 AM UTC

Desh Raj

@rdesh26

29 Aug 2023

If you watch this space, you already know my love for the neural transducer. I skimmed through all 21 papers relating to transducers that were presented at #INTERSPEECH2023, and wrote a summary blog: desh2608.github.io/2023-08-2… Summary in 5 bullets:

Transducers at InterSpeech 2023

Neural transducers are the most popular ASR modeling paradigm in both academia and industry. Since I could not attend InterSpeech 2023 in person, I decided to sift through the archive and find all...

desh2608.github.io

4,386

Desh Raj · Jun 21, 2023 · 8:31 AM UTC

Desh Raj

@rdesh26

21 Jun 2023

🥁 New pre-print 🥁 "SURT 2.0: Advances in Transducer-based Multi-talker Speech Recognition" abs: arxiv.org/abs/2306.10559 website*: sites.google.com/view/surt2 *includes recipes and pre-trained models A ~short~ thread 👇 1/

SURT 2.0: Advances in Transducer-based Multi-talker Speech Recognition

The Streaming Unmixing and Recognition Transducer (SURT) model was proposed recently as an end-to-end approach for continuous, streaming, multi-talker speech recognition (ASR). Despite impressive...

arxiv.org

12,777

Desh Raj · Dec 1, 2021 · 5:24 PM UTC

Desh Raj

@rdesh26

1 Dec 2021

📢 I will spend summer '22 interning with the AI Speech team at Meta (formerly Facebook), in Menlo Park, California 🌞

Desh Raj · Aug 4, 2023 · 8:57 AM UTC

Desh Raj

@rdesh26

4 Aug 2023

10 years ago, WFST-based methods were the norm for speech processing (think, Kaldi). Since then, end-to-end models have become quite the rage --- they are simple, do not require much domain expertise, and you can train a PyTorch model for a new task over a weekend. ⚡️ 1/n

10,012

Desh Raj · Mar 31, 2023 · 3:25 PM UTC

Desh Raj

@rdesh26

31 Mar 2023

📢 Our tutorial on "Training Efficient Transducers with Large Data using Open-source Tools" has been accepted at InterSpeech 2023 @ISCAInterspeech: interspeech2023.org/tutorial… Time for a short 🧵 1/

7,602

Desh Raj · Jan 2, 2024 · 5:40 AM UTC

Desh Raj

@rdesh26

2 Jan 2024

In 2024, I want to: - wrap up my PhD - finish reading the Wheel of Time - climb v5 bouldering problems - speak more French

13,660

Desh Raj · Nov 11, 2023 · 4:22 PM UTC

Desh Raj

@rdesh26

11 Nov 2023

Now that @WavLab has created an open-science alternative to Whisper (called OWSM), I hope researchers building/analyzing whisper-based systems switch to OWSM instead!

7,531

Desh Raj · Jun 9, 2023 · 10:09 AM UTC

Desh Raj

@rdesh26

9 Jun 2023

Busy morning at @ieeeICASSP today presenting posters on SSL for multi-talker ASR, and my thesis work for the Rising Stars session! Thanks for showing up and asking great questions :)

2,818

Desh Raj · May 13, 2021 · 9:32 PM UTC

Desh Raj

@rdesh26

13 May 2021

📢 Summer update 📢 I will be interning with the awesome #speech people at Microsoft. I'll work on cool transducer-based streaming models for multi-speaker ASR. P.S.: Please send me your fav neural transducer paper recommendations 😀

Desh Raj · Aug 11, 2023 · 12:06 PM UTC

Desh Raj

@rdesh26

11 Aug 2023

10 days to go for our #interspeech2023 tutorial on next-gen Kaldi!

3,426

Desh Raj · Oct 30, 2020 · 10:39 PM UTC

Desh Raj

@rdesh26

30 Oct 2020

Interspeech 2020 ended yesterday. I made a blog post listing the papers I found interesting (mostly about ASR, diarization, and a bit of target speaker extraction): desh2608.github.io/2020-10-3… 1/n #interspeech2020

Desh's curated list of ASR, diarization, and related papers from Interspeech 2020

Interspeech 2020 just ended, and here is my curated list of papers that I found interesting from the proceedings. Disclaimer: This list is based on my research interests at present: ASR, speaker...

desh2608.github.io

Desh Raj · Sep 3, 2023 · 11:02 AM UTC

Desh Raj

@rdesh26

3 Sep 2023

My friend Vipul invited me on his podcast to to talk about speech, PhD, and more!

Vipul Vaibhaw

@vaibhaw_vipul

3 Sep 2023

🎙️ Ep 12 - Dive into the world of Speech Recognition with @rdesh26 #TheDistributedFabricPod We're delving into: - Automatic Speech Recognition 🗣️ - Self-Supervised Learning 🧠 - Navigating life as a PhD student 📚 - And much more awesomeness! 🎧 piped.video/watch?v=_uUj3BNO…

7,644

Desh Raj · Feb 13, 2023 · 9:27 AM UTC

Desh Raj

@rdesh26

13 Feb 2023

🚿 thoughts: Training an NN is like training in the gym. Initial gains are high and then slowly plateau; auxiliary objectives such as diets are useful; you can converge faster if you start with a pre-trained body; different architectures have different scaling laws (genetics).

7,421

Desh Raj · Apr 3, 2024 · 3:21 PM UTC

Desh Raj

@rdesh26

3 Apr 2024

Replying to @abhish_eksharma

Possibly

59,571

Desh Raj · Aug 16, 2023 · 6:59 PM UTC

Desh Raj

@rdesh26

16 Aug 2023

Thanks @ISCAInterspeech for the acknowledgment :)

3,574

Desh Raj · Mar 10, 2023 · 2:30 PM UTC

Desh Raj

@rdesh26

10 Mar 2023

So much work has happened in E2E ASR in the last decade. Will spend my weekend with these awesome review papers: 1. arxiv.org/abs/2111.01690 by Jinyu Li 2. arxiv.org/abs/2303.03329 by Rohit Prabhavalkar, Takaaki Hori, Tara Sainath, Ralf Schluter & @shinjiw_at_cmu

5,504

Desh Raj · Dec 13, 2022 · 6:38 AM UTC

Desh Raj

@rdesh26

13 Dec 2022

F1 visa renewal process. Time spent in: - filling application: ~ 1 hour - trying to book interview slot: > 2 weeks - standing in line at embassy: ~ 2 hours - interviewing: < 1 minute

Desh Raj · Dec 18, 2023 · 12:36 PM UTC

Desh Raj

@rdesh26

18 Dec 2023

I skipped ASRU to attend my friend's wedding. Great decision!

9,994

Desh Raj · Dec 13, 2022 · 4:58 AM UTC

Desh Raj

@rdesh26

13 Dec 2022

📢📢 New preprint just dropped 📢📢 "GPU-accelerated guided source separation for meeting transcription" Paper: arxiv.org/abs/2212.05271 Code: github.com/desh2608/gss 1/n

GPU-accelerated Guided Source Separation for Meeting Transcription

Guided source separation (GSS) is a type of target-speaker extraction method that relies on pre-computed speaker activities and blind source separation to perform front-end enhancement of...

arxiv.org

Desh Raj · Oct 28, 2019 · 5:52 PM UTC

Desh Raj

@rdesh26

28 Oct 2019

Replying to @math_rachel

I don't know much about images, but anyone who thinks speech is a solved problem is welcome to participate in the upcoming Chime-6 challenge :-)

Desh Raj · Jan 8, 2020 · 11:12 AM UTC

Desh Raj

@rdesh26

8 Jan 2020

Since 2018, there has been immense interest in using Transformers for ASR. In my new blog post, I look at the various challenges and the solutions people have proposed. #SpeechProc desh2608.github.io/2020-01-0…

The Challenges of using Transformers in ASR

Since mid 2018 and throughout 2019, one of the most important directions of research in speech recognition has been the use of self-attention networks and transformers, as evident from the numerous...

desh2608.github.io

Desh Raj · Jun 14, 2024 · 2:23 PM UTC

Desh Raj

@rdesh26

14 Jun 2024

"Everyone wants to do the model work ⚙️, not the data work 🗃️." Throughout my PhD, I mostly published modeling papers. These are the papers that identify a problem, propose a modeling solution, and show results on standard benchmarks. 1/n

8,748

Desh Raj · Oct 27, 2023 · 3:35 PM UTC

Desh Raj

@rdesh26

27 Oct 2023

I am convinced that US immigrant brains are 80% useful stuff and 20% random visa-related information.

5,584

Desh Raj · Jun 16, 2021 · 2:43 PM UTC

Desh Raj

@rdesh26

16 Jun 2021

I attended @ieeeICASSP last week, and here are my 3 main take-aways: desh2608.github.io/2021-06-1… 1. Self-training and contrastive learning are here to stay 2. Transducer models + T-S learning = streaming ASR 3. Speaker diarization is wide-open (clustering, EEND, separation ...)

My 3 takeaways from IEEE ICASSP 2021

I attended the virtual ICASSP 2021, and this is a short post with my 3 key take-aways from the conference. As with my previous conference summary posts, this post is heavily biased by my research...

desh2608.github.io

Desh Raj · Feb 14, 2022 · 3:05 PM UTC

Desh Raj

@rdesh26

14 Feb 2022

It seems several groups have recently been looking at extending/generalizing ASR objectives. Baidu proposed W-CTC (openreview.net/forum?id=0RqD…) which extends CTC for training with data that contains missing labels on the ends.

Desh Raj · Jan 30, 2021 · 2:07 PM UTC

Desh Raj

@rdesh26

30 Jan 2021

Academia, as in life, sometimes brings you bittersweet days. On the same day that I gave a talk at the @jhuclsp seminar for the first time, I also got a paper rejected at #icassp2021. Nevertheless, I celebrated both with 🍷 at the end of the day :)

Desh Raj · May 4, 2023 · 5:55 PM UTC

Desh Raj

@rdesh26

4 May 2023

2) 🥳 I have been selected for ✨ICASSP Rising Stars in Signal Processing✨ Please join us on June 9 in the poster session where I will talk about my thesis work.

5,246

Desh Raj · Aug 13, 2023 · 9:17 PM UTC

Desh Raj

@rdesh26

13 Aug 2023

Ready for the poster session at #interspeech2023!

ALT Poster for InterSpeech paper.

4,214

Desh Raj · May 17, 2023 · 6:26 PM UTC

Desh Raj

@rdesh26

17 May 2023

This work has been accepted at @ISCAInterspeech 🥳

Desh Raj

@rdesh26

13 Dec 2022

📢📢 New preprint just dropped 📢📢 "GPU-accelerated guided source separation for meeting transcription" Paper: arxiv.org/abs/2212.05271 Code: github.com/desh2608/gss 1/n

3,382

Desh Raj · Dec 29, 2021 · 6:07 PM UTC

Desh Raj

@rdesh26

29 Dec 2021

After 3.5 years in the US, I have grown used to driving in mi/h and weighing stuff in lbs, but still can't wrap my head around Fahrenheit 🧐

Desh Raj · Feb 13, 2021 · 1:22 AM UTC

Desh Raj

@rdesh26

13 Feb 2021

I spent at least 1 hour yesterday looking at the beautiful snow-covered trees outside my window while I played my guitar. Gonna clock it under "software development" hours because mental software got pretty damn developed.

Desh Raj · Jun 7, 2021 · 2:09 PM UTC

Desh Raj

@rdesh26

7 Jun 2021

Tutorial on "Distant conversational speech recognition" is now underway at #ICASSP2021. Slides available here: github.com/ICASSP2021-tutori…

GitHub - ICASSP2021-tutorial9/Distant_conversational_ASR_and_analysis

Contribute to ICASSP2021-tutorial9/Distant_conversational_ASR_and_analysis development by creating an account on GitHub.

github.com

Desh Raj · Mar 25, 2023 · 1:34 PM UTC

Desh Raj

@rdesh26

25 Mar 2023

Sure, ChatGPT is cool. But is it cooler than an all-flannel @jhuclsp line-up? (ca. 2020)

ALT People wearing flannel shirts and trying to look cool.

4,125

Desh Raj · Mar 25, 2021 · 10:29 PM UTC

Desh Raj

@rdesh26

25 Mar 2021

What @INTERSPEECH2021 says: Submission deadline is Mar 26; papers can be updated by Apr 2. What I read: Submission deadline is Apr 2; must make a dummy submission by Mar 26.

Desh Raj · Dec 18, 2019 · 5:10 AM UTC

Desh Raj

@rdesh26

18 Dec 2019

Wins Best Paper award @asru2019 Congratulations! @jhuclsp

Desh Raj

@rdesh26

12 Dec 2019

Replying to @rdesh26

MIMO-SPEECH: End-to-End Multi-Channel Multi-Speaker Speech Recognition Xuankai Chang, Wangyou Zhang, Yanmin Qian, Jonathan Le Roux, Shinji Watanabe ADV: ASR in Adverse Environments Sunday, 15 December, 16:00 - 17:30 arxiv.org/abs/1910.06522

Desh Raj · Mar 3, 2024 · 4:17 PM UTC

Desh Raj

@rdesh26

3 Mar 2024

It's a beautiful day to do some @icmlconf reviews 📝

7,180

Desh Raj · Nov 7, 2023 · 1:30 AM UTC

Desh Raj

@rdesh26

7 Nov 2023

Presenting at the NSF CIRC meeting today

2,615

Desh Raj · Jul 6, 2024 · 9:15 PM UTC

Desh Raj

@rdesh26

6 Jul 2024

Meeting up with Boston #speech folks over BBQ and beer 🍻 (minus @JonathanLeRoux)

5,041

Desh Raj · Oct 27, 2022 · 12:45 PM UTC

Desh Raj

@rdesh26

27 Oct 2022

Co-author: We'll need to remove this section. There's no space. Me (a 5th year PhD student): There's always space. #icassp

Desh Raj · Oct 16, 2021 · 2:29 AM UTC

Desh Raj

@rdesh26

16 Oct 2021

Replying to @zacharynado

Desh Raj · Jun 22, 2023 · 8:01 PM UTC

Desh Raj

@rdesh26

22 Jun 2023

The JSALT summer school wraps up tomorrow. On Monday, the workshop starts. Looking forward to 6 weeks of WFSTs + speech! Pic from: jsalt2023.univ-lemans.fr/en/…

3,315

Desh Raj · Jan 22, 2022 · 3:00 AM UTC

Desh Raj

@rdesh26

22 Jan 2022

This is now accepted for publication at @ieeeICASSP 2022 🎉

Desh Raj

@rdesh26

12 Oct 2021

Replying to @rdesh26

How do you create a hybrid ASR system for a new language X with only 15 mins of transcribed speech? Answer: Use XLSR-53, transcribed speech from other languages, and extra text from language X

Desh Raj · Jan 11, 2023 · 4:34 AM UTC

Desh Raj

@rdesh26

11 Jan 2023

"Vasudhaiva kutumbakam" is a Sanskrit phrase which means "the world is one family." Featured: Dinner with other young researchers at @ieee_slt.

2,066

Desh Raj · Mar 21, 2024 · 1:16 PM UTC

Desh Raj

@rdesh26

21 Mar 2024

5,762

Desh Raj · Oct 27, 2023 · 9:16 PM UTC

Desh Raj

@rdesh26

27 Oct 2023

Reviewing an ICLR paper and came across this gem: "LMs have been widely used in ASR in the last 2 decades."

9,405

Desh Raj · Sep 22, 2022 · 1:53 PM UTC

Desh Raj

@rdesh26

22 Sep 2022

A friend who works on analysis of brain signals asked me if there were some ASR techniques that they could use. I briefly explained SSL methods, but mentioned that it would require tons of data. "How much data do you have?" Friend: "Like 3 hours. That's a lot, right?" 😅

Desh Raj · Sep 27, 2023 · 11:49 AM UTC

Desh Raj

@rdesh26

27 Sep 2023

Now published at IEEE Transactions on Audio, Speech, and Language Processing (TASLP). Early access here: ieeexplore.ieee.org/document…

Desh Raj

@rdesh26

21 Jun 2023

2,733

Desh Raj · Jul 28, 2023 · 1:08 PM UTC

Desh Raj

@rdesh26

28 Jul 2023

Hermann Ney's talk at JSALT was intense and enlightening! Dr. Ney been at the fore-front of data-driven ML for 45 years now, and hearing his views on the field really puts things in perspective.

2,866

Desh Raj · Nov 21, 2023 · 12:56 AM UTC

Desh Raj

@rdesh26

21 Nov 2023

Pleasantly surprised to get a complimentary NeurIPS registration for reviewing services. (Have other travel plans on those dates unfortunately, but will check out the virtual program) More conferences should do this! @ISCAInterspeech 👀

3,661

Desh Raj · Jul 22, 2020 · 1:45 PM UTC

Desh Raj

@rdesh26

22 Jul 2020

My go-to debugging technique when I'm stuck on an issue working late night is to stop working and go to bed. During the night, programming angels drop in and fix the bug, so I wake up to a working code.

Desh Raj · Jan 14, 2023 · 3:47 PM UTC

Desh Raj

@rdesh26

14 Jan 2023

The last time I went to a #SpeechProc conference in person was ASRU'19 in Singapore as a 1st year grad. This time around at @ieee_slt, as a mentor at the SLT-CODE hackathon and then a volunteer, I got to experience the conference from a very different perspective!

2,169

Desh Raj · Mar 5, 2021 · 8:04 PM UTC

Desh Raj

@rdesh26

5 Mar 2021

📢A new tool and a blog post to make diarization evaluation simple+fast. Code: github.com/desh2608/spyder Post: desh2608.github.io/2021-03-0… - Implemented in C++ for ~5x speedup over md-eval - To install: `pip install spy-der` - Use from within your Python program / command line

Desh Raj · Jun 15, 2023 · 8:34 AM UTC

Desh Raj

@rdesh26

15 Jun 2023

Next up in the JSALT summer school program this morning is a talk by @ryandcotterell.

2,854

Desh Raj · Jun 18, 2021 · 3:06 PM UTC

Desh Raj

@rdesh26

18 Jun 2021

Received some more love in the form of an ISCA Travel Grant! Thanks @INTERSPEECH2021 ✨ Now begins the struggle for a visa 😅

Desh Raj

@rdesh26

2 Jun 2021

After getting straight rejects for the last 2 years, I finally received some love from @INTERSPEECH2021. Congratulations and loads of gratitude to my co-authors! Can we fly to Brno already?! ✨

Desh Raj · Apr 3, 2024 · 3:23 PM UTC

Desh Raj

@rdesh26

3 Apr 2024

Replying to @iver56

Haha not a bad idea

45,293

Desh Raj · May 4, 2023 · 5:55 PM UTC

Desh Raj

@rdesh26

4 May 2023

📢 Now that my Schengen visa is approved, time for some quick updates and summer plans!

5,011

Desh Raj · Jan 22, 2022 · 5:51 AM UTC

Desh Raj

@rdesh26

22 Jan 2022

My internship work from last summer is now accepted for publication at @ieeeICASSP 😄 Immense gratitude to my collaborators Liang, Zhuo, Yashesh, and Jinyu for their guidance. pdf: arxiv.org/pdf/2109.08555.pdf abs: arxiv.org/abs/2109.08555

Desh Raj

@rdesh26

13 May 2021

Desh Raj · Jun 16, 2020 · 1:49 AM UTC

Desh Raj

@rdesh26

16 Jun 2020

Last month, our system from JHU CLSP achieved 2nd best WER in the CHiME-6 challenge (track 2: dinner party diarization + ASR). The system description paper is now available at: arxiv.org/abs/2006.07898

Desh Raj · Oct 12, 2021 · 6:05 PM UTC

Desh Raj

@rdesh26

12 Oct 2021

📢 New paper on ArXiv 📢 "Injecting text and cross-lingual supervision in few-shot learning from self-supervised models" abs: arxiv.org/abs/2110.04863 pdf: arxiv.org/pdf/2110.04863.pdf

Desh Raj · Jan 12, 2023 · 8:25 PM UTC

Desh Raj

@rdesh26

12 Jan 2023

This short collaboration with @SamueleCornell and colleagues on "separation+diarization" turned out well. Looking forward to working together on CHiME-7! chimechallenge.org/current/t…

2,841

Desh Raj · Sep 11, 2020 · 12:58 PM UTC

Desh Raj

@rdesh26

11 Sep 2020

The next generation Kaldi is under development, and you can help in crafting a roadmap for its next life cycle: kaldi.dev/

Desh Raj · Oct 12, 2021 · 6:05 PM UTC

Desh Raj

@rdesh26

12 Oct 2021

How do you create a hybrid ASR system for a new language X with only 15 mins of transcribed speech? Answer: Use XLSR-53, transcribed speech from other languages, and extra text from language X

Desh Raj · Mar 1, 2023 · 1:45 PM UTC

Desh Raj

@rdesh26

1 Mar 2023

Every time I leave my parents to come back to the US, I feel a little sad for leaving them. Then I arrive at @PatnaAirport and remember why I left in the first place. I have been to bus stations better run than this shithole.

5,262

Desh Raj · Sep 27, 2022 · 4:33 PM UTC

Desh Raj

@rdesh26

27 Sep 2022

#Lhotse now supports annotating audio files with OpenAI's #Whisper! Here's a quick demo I created in 10 mins to transcribe the first 100 recordings from VoxCeleb. See PR from @PiotrZelasko: github.com/lhotse-speech/lho… Here's the sample audio: tinyurl.com/3vjxnubk

Desh Raj · Apr 15, 2020 · 1:56 AM UTC

Desh Raj

@rdesh26

15 Apr 2020

ICASSP is free :)

IEEE Signal Processing Society @IEEEsps

14 Apr 2020

Registration for the first virtual #ICASSP2020 is now open! SPS is excited to offer complimentary registration to non-authors, sharing our cutting-edge ICASSP sessions and energizing our signal processing community around the globe. Register today! cmsworkshops.com/ICASSP2020/…

Desh Raj · Aug 26, 2020 · 11:28 AM UTC

Desh Raj

@rdesh26

26 Aug 2020

26 🥳 and newly found love for sparkling wine 😆

Desh Raj · Dec 10, 2020 · 2:13 PM UTC

Desh Raj

@rdesh26

10 Dec 2020

My Instagram feed is full of people getting married or attending weddings. My Twitter feed is full of academics arguing about every little thing under the sun. Intense contest for which account gets deactivated first.

Desh Raj · Jun 13, 2024 · 12:20 AM UTC

Desh Raj

@rdesh26

13 Jun 2024

Hey @AIatMeta, look and tell me how cool these RBM glasses are!

3,207

Desh Raj · May 4, 2023 · 5:55 PM UTC

Desh Raj

@rdesh26

4 May 2023

3) 🇫🇷 Right after ICASSP, I will spend 8 weeks at Le Mans University (France), participating in JSALT 2023! I will work on "WFST methods for modern speech processing" alongside researchers from @Google, @rev, @ButSpeech, and more. Check 👇 jsalt2023.univ-lemans.fr/en/…

1,357

Desh Raj · Jan 13, 2021 · 2:22 PM UTC

Desh Raj

@rdesh26

13 Jan 2021

I haven't reviewed a lot, but I make sure that I always provide some +ve comments about the paper, and make my -ve comments come across as constructive feedback. It is quite disheartening, then, to receive a review where it seems the reviewer is out to personally get you.

Desh Raj · Mar 18, 2021 · 11:21 AM UTC

Desh Raj

@rdesh26

18 Mar 2021

I'll be talking about some of my recent work on speaker diarization in the @iscasigml seminar on May 5 :)

ISCA-SIGML @iscasigml

17 Mar 2021

We are delighted to announce the ISCA SIGML seminar series. This seminar series focuses on speech processing, providing a place for speech researchers to present, discuss, learn, and exchange ideas. Please find the schedule in the following webpage. homepages.inf.ed.ac.uk/htang…

Desh Raj · May 1, 2023 · 3:48 PM UTC

Desh Raj

@rdesh26

1 May 2023

I will present this work in an oral session at ICASSP: Session: Multi-speaker ASR When: June 7, 2023 at 10:50 AM EEST Where: Rhodes, Greece Slides and a short video are now available: 💾: tinyurl.com/u5jj4nfp 📽️: piped.video/watch?v=L2WnjQC8…

Desh Raj

@rdesh26

17 Feb 2023

Replying to @rdesh26

This was work done with my @MetaAI colleagues Junteng, Jay, Chunyang, Niko, Xiaohui, and Ozlem, with whom I spent a really fun 14 weeks last summer. (10/n) Paper: arxiv.org/abs/2210.11588

2,981

Desh Raj · Jun 27, 2021 · 6:38 PM UTC

Desh Raj

@rdesh26

27 Jun 2021

Me learning French: cool, now I can speak the local language in Montréal! YUL check-in: Sir do you speak English or parlez-vous français? Me tongue-tied: Parle anglais 😓

Desh Raj · Dec 22, 2020 · 8:14 AM UTC

Desh Raj

@rdesh26

22 Dec 2020

I'm planning to take the next 2 weeks off, which means I'll prepare the presentation videos for SLT 2021 (due on Jan 5) and work on some implementation that I have been putting off for a while. Happy holidays to you too! #phdlife

Desh Raj · Feb 17, 2023 · 8:49 AM UTC

Desh Raj

@rdesh26

17 Feb 2023

Brother's wedding: Feb 20-26 (Indian wedding) Holi festival: March 8 InterSpeech submission: March 8 ICASSP camera-ready: March 13 Looks like an eventful next few weeks!

INTERSPEECH 2025 @ISCAInterspeech

17 Feb 2023

In response to some queries, please be aware that we will NOT be extending the paper deadline for #INTERSPEECH2023 . We have a very tight schedule for reviewing! You can make it!

4,852

Desh Raj · Feb 17, 2020 · 3:43 PM UTC

Desh Raj

@rdesh26

17 Feb 2020

Congratulations Dr. Snyder! david-ryan-snyder.github.io/

Desh Raj · Dec 6, 2022 · 3:05 PM UTC

Desh Raj

@rdesh26

6 Dec 2022

Nothing screams "India" quite like getting 8 passport photos in under $1 in 10 mins, and then spending 2 hours at the bank to get an account phone number changed.

Desh Raj · Aug 4, 2020 · 2:23 PM UTC

Desh Raj

@rdesh26

4 Aug 2020

Replying to @tallinzen @joabingel @huggingface

The flag<->language thing is also an issue for countries with huge language diversity. If they hire a Hindi-speaker, for instance, it wouldn't work to add the Indian flag to the list because 56% Indians don't speak Hindi as their first language (and 43% don't speak Hindi at all).

Desh Raj · Mar 31, 2021 · 3:06 PM UTC

Desh Raj

@rdesh26

31 Mar 2021

This doesn't give me a free pass to socialize. We must practice social distancing until each of us has been vaccinated :) But it's definitely a small win! 🎉