Arvind Neelakantan · Apr 11, 2025 · 6:03 PM UTC

Arvind Neelakantan

Arvind Neelakantan

@arvind_io

11 Apr 2025

thrilled to be back @Google in the @GoogleDeepMind team! The technical breadth and expertise across the whole stack (hardware->infra->deep learning->products) is truly mind-blowing. Great to see a lot of familiar faces and meet new friends. Look forward to learning a lot!

1,141

97,366

Arvind Neelakantan · Sep 10, 2024 · 6:22 PM UTC

Arvind Neelakantan

@arvind_io

10 Sep 2024

Excited to join @AIatMeta! The past 4.5 years at @OpenAI,working on embeddings, GPT-3 & 4,API and ChatGPT, have been career highlights. Now, I'm thrilled to work on the next generations of Llama and contribute to its impact on the developer ecosystem and billions of users!🚀 1/2

1,133

143,141

Arvind Neelakantan · Nov 1, 2019 · 2:51 AM UTC

Arvind Neelakantan

@arvind_io

1 Nov 2019

We explore a simple approach to task-oriented dialog. A single neural network consumes conversation history and external knowledge as input and generates the next turn text response along with the action (when necessary) as output. Paper: arxiv.org/pdf/1910.14613.pdf 1/4

227

Arvind Neelakantan · May 29, 2018 · 4:57 AM UTC

Arvind Neelakantan

@arvind_io

29 May 2018

We develop a non-autoregressive machine translation model whose accuracy almost matches a strong greedy autoregressive baseline Transformer, while being 3.3 times faster at inference. Joint work with @ashVaswani @nikiparmar09 Aurko Roy arxiv.org/abs/1805.11063

Theory and Experiments on Vector Quantized Autoencoders

Deep neural networks with discrete latent variables offer the promise of better symbolic reasoning, and learning abstractions that are more useful to new tasks. There has been a surge in interest...

arxiv.org

206

Arvind Neelakantan · Jan 28, 2022 · 10:21 PM UTC

Arvind Neelakantan

@arvind_io

28 Jan 2022

A thread on how we evaluate our embedding models in OpenAI’s API. We achieve state-of-the-art results in linear probe classification, text search and code search. It’s not fine-tuned, so it works great in the real world — and our customers love it. 1/7

155

Arvind Neelakantan · Jan 27, 2023 · 10:09 PM UTC

Arvind Neelakantan

@arvind_io

27 Jan 2023

Replying to @tszzl

imagine being told you are wrong million times a second, for a few months.

4,577

Arvind Neelakantan · Jan 31, 2022 · 9:04 PM UTC

Arvind Neelakantan

@arvind_io

31 Jan 2022

Zero-shot results of OpenAI API’s embeddings on the FIQA search dataset. Evaluation script: github.com/arvind-neural/bei… We zero-shot evaluated on 14 text search datasets, our embeddings outperform keyword search and previous dense embedding methods on 11 of them!

Arvind Neelakantan

@arvind_io

28 Jan 2022

Replying to @arvind_io

In text search tasks, we obtain best zero-shot results in msmarco, triviaQA, and NQ and also the best transfer results on the BEIR benchmark. 5/7

Arvind Neelakantan · Sep 13, 2019 · 4:19 PM UTC

Arvind Neelakantan

@arvind_io

13 Sep 2019

We are excited to release Taskmaster-1, a new task-oriented dialog dataset. We explore two methods for data collection, two-person and self-dialogs. Surprisingly self-dialogs are an effective way to collect dialog. Paper accepted to @emnlp2019 : arxiv.org/pdf/1909.05358v1.p…

Arvind Neelakantan · Sep 10, 2024 · 6:22 PM UTC

Arvind Neelakantan

@arvind_io

10 Sep 2024

look forward to working with @manohar_paluri, @Ahmad_Al_Dahle, @edunov and many others in the excellent @AIatMeta team! 2/2

20,204

Arvind Neelakantan · Feb 8, 2022 · 6:51 PM UTC

Arvind Neelakantan

@arvind_io

8 Feb 2022

Thanks for a balanced take! Couple of comments that are also added to the video description now: 1/4

Yannic Kilcher 🇸🇨

@ykilcher

8 Feb 2022

🔥New Video🔥 OpenAI now offers embeddings for text similarity and search, but are they holding up? We look at the release, the paper, the criticism, and most important: the price! Are the embeddings worth it? Watch here to find out: piped.video/5skIqoO3ku0

ALT OpenAI Embeddings

Arvind Neelakantan · Jan 28, 2022 · 10:21 PM UTC

Arvind Neelakantan

@arvind_io

28 Jan 2022

Small models specifically fine-tuned on a dataset can do well on a narrow benchmark, but they far underperform in real-world settings, as many of our customers are discovering. This study from @FineTuneLearn shows our API performance. 7/7

Arvind Neelakantan · Feb 1, 2022 · 11:56 PM UTC

Arvind Neelakantan

@arvind_io

1 Feb 2022

OpenAI embeddings work on a very broad set of use cases. Here, Viable gets a 7.7% absolute improvement in clustering quality using OpenAI embeddings when compared to previous methods!

Viable 🎯@askviable

1 Feb 2022

We tested different embedding models and show the data behind why GPT-3 was the clear winner for our clustering needs askviable.com/blog/why-we-ch…

Arvind Neelakantan · Feb 1, 2022 · 5:47 PM UTC

Arvind Neelakantan

@arvind_io

1 Feb 2022

The cost to run this experiment with text-search-ada, embedding both documents and queries, is ~$80. text-search-ada achieves a 62% relative improvement over keyword search here!

Arvind Neelakantan

@arvind_io

31 Jan 2022

Arvind Neelakantan · Feb 29, 2024 · 7:44 PM UTC

Arvind Neelakantan

@arvind_io

29 Feb 2024

@OpenAI embeddings api over time

2,426

Arvind Neelakantan · Jun 12, 2019 · 4:25 AM UTC

Arvind Neelakantan

@arvind_io

12 Jun 2019

We describe a simple technique to parallelize Scheduled Sampling across time that allows us to apply Scheduled Sampling for problems that involve generating very long sequences. We get better sample quality and train almost as fast as teacher-forcing. arxiv.org/abs/1906.04331

Arvind Neelakantan · Feb 9, 2022 · 11:46 PM UTC

Arvind Neelakantan

@arvind_io

9 Feb 2022

Replying to @ylecun

For the same reason a kind of unsupervised learning that people were always doing was branded as self-supervised learning 😉

Arvind Neelakantan · Jan 25, 2022 · 6:50 PM UTC

Arvind Neelakantan

@arvind_io

25 Jan 2022

We've trained embedding models to produce high quality text and code embeddings. Our general purpose embeddings achieve top results in classification, text search, and code search. The models are now available in the @OpenAI API: openai.com/blog/introducing-…

Introducing text and code embeddings

We are introducing embeddings, a new endpoint in the OpenAI API that makes it easy to perform natural language and code tasks like semantic search, clustering, topic modeling, and classification.

openai.com

OpenAI

@OpenAI

25 Jan 2022

We're introducing embeddings, a new feature of our API that distills relationships between concepts, sentences, and even code in a simple numerical representation — for more powerful search, classification, and recommendations. openai.com/blog/introducing-…

Arvind Neelakantan · May 11, 2023 · 6:45 PM UTC

Arvind Neelakantan

@arvind_io

11 May 2023

@OpenAI embeddings achieve better retrieval performance and are also lot cheaper! Results taken from: arxiv.org/pdf/2305.06300.pdf

2,612

Arvind Neelakantan · Jan 28, 2022 · 10:21 PM UTC

Arvind Neelakantan

@arvind_io

28 Jan 2022

My team and I trained the model. We look at 33 datasets across four different categories: linear probe classification, sentence similarity, text search, and code search. All these results and figures were in our paper, released this week. arxiv.org/pdf/2201.10005.pdf 2/7

Arvind Neelakantan · Jan 28, 2022 · 10:21 PM UTC

Arvind Neelakantan

@arvind_io

28 Jan 2022

In text search tasks, we obtain best zero-shot results in msmarco, triviaQA, and NQ and also the best transfer results on the BEIR benchmark. 5/7

Arvind Neelakantan · Jun 17, 2025 · 7:17 PM UTC

Arvind Neelakantan

@arvind_io

17 Jun 2025

amazing multimodality performance (& more) !! storage.googleapis.com/deepm…

4,620

Arvind Neelakantan · May 21, 2025 · 12:13 AM UTC

Arvind Neelakantan

@arvind_io

21 May 2025

TPU -> XLA -> JAX -> Transformer, MoE, Chinchilla, AlphaGo, .... -> Gemini, Veo, .... -> Search, YouTube, Waymo, ... -> Chrome, Android, .... 🤯🤯🤯

2,948

Arvind Neelakantan · Feb 23, 2022 · 6:04 PM UTC

Arvind Neelakantan

@arvind_io

23 Feb 2022

OpenAI Embeddings helps you go beyond keyword search!

Lilian Weng

@lilianweng

22 Feb 2022

Replying to @lilianweng

The code is actually extremely simple for a cool app like this - open sourced here: github.com/lilianweng/emoji-…

Arvind Neelakantan · Apr 29, 2021 · 7:25 PM UTC

Arvind Neelakantan

@arvind_io

29 Apr 2021

A nice blog post from @mavenoid on how to get reliable generations from GPT-3: blog.mavenoid.com/gpt3-and-c… Unsupervised retrieval+generation FTW!

Why GPT-3 is a big deal for customer support

AI text generation is fast approaching human-like levels, revolutionizing the way many industries do business. And it's already making customer support more efficient.

mavenoid.com

Arvind Neelakantan · Jan 28, 2022 · 10:21 PM UTC

Arvind Neelakantan

@arvind_io

28 Jan 2022

We also achieve new state-of-the-art results on code search. 6/7

Arvind Neelakantan · Dec 14, 2019 · 3:23 AM UTC

Arvind Neelakantan

@arvind_io

14 Dec 2019

Check out our spotlight talk and poster describing the Neural Assistant work in the ConvAI workshop tomorrow @NeurIPSConf #neurips19 alborz-geramifard.com/worksh…

Arvind Neelakantan

@arvind_io

1 Nov 2019

Arvind Neelakantan · Sep 10, 2024 · 8:25 PM UTC

Arvind Neelakantan

@arvind_io

10 Sep 2024

Replying to @sharan0909 @AIatMeta @OpenAI

yes!!!

2,287

Arvind Neelakantan · Jul 17, 2025 · 5:29 PM UTC

Arvind Neelakantan

@arvind_io

17 Jul 2025

Replying to @agihippo

Good but hard to not have @DBahdanau

8,223

Arvind Neelakantan · Apr 24, 2020 · 1:11 AM UTC

Arvind Neelakantan

@arvind_io

24 Apr 2020

We do a large-scale human study to compare different decoding methods for language generation and develop a globally normalized decoding method that optimally traverses the quality-diversity curve.

Daniel Duckworth @duck

24 Apr 2020

How does one trade-off sample quality and diversity in a language model? Which decoding method is best? We introduce a multi-objective framework maximizing human judgement score subject to a constraint on diversity (entropy). arxiv.org/abs/2004.10450 (1/7)

Arvind Neelakantan · Jan 28, 2022 · 10:21 PM UTC

Arvind Neelakantan

@arvind_io

28 Jan 2022

In linear probe classification, we obtain best results wrt average accuracy on seven classification tasks. 3/7

Arvind Neelakantan · Jan 28, 2022 · 10:21 PM UTC

Arvind Neelakantan

@arvind_io

28 Jan 2022

In sentence similarity tasks, we perform worse than previous work. This was explained in our paper as well. 4/7

Arvind Neelakantan · May 27, 2023 · 6:38 PM UTC

Arvind Neelakantan

@arvind_io

27 May 2023

Replying to @WilliamWangNLP

Thanks for having me, I had a fun time visiting @ucsbNLP !

600

Arvind Neelakantan · Jul 5, 2021 · 6:08 PM UTC

Arvind Neelakantan

@arvind_io

5 Jul 2021

Replying to @ilyasut

Belief is all you need!

Arvind Neelakantan · Feb 7, 2017 · 1:30 PM UTC

Arvind Neelakantan

@arvind_io

7 Feb 2017

Our paper (arxiv.org/abs/1611.08945 ) on neural program induction accepted to #ICLR2017 ! Code: github.com/tensorflow/models… #DeepLearning #NLProc

Arvind Neelakantan · Jan 29, 2022 · 2:13 AM UTC

Arvind Neelakantan

@arvind_io

29 Jan 2022

in case people are counting, I forgot to share the results for text search from 3 more datasets (apart from the 11 text search results already reported) 🙂

Arvind Neelakantan

@arvind_io

28 Jan 2022

Replying to @arvind_io

Arvind Neelakantan · Jun 20, 2019 · 4:09 AM UTC

Arvind Neelakantan

@arvind_io

20 Jun 2019

Replying to @quocleix @ZihangDai @rsalakhu

Nice! Would be interesting to compare with vanilla Transformer trained using the new objective.

Arvind Neelakantan · Jan 25, 2022 · 7:22 PM UTC

Arvind Neelakantan

@arvind_io

25 Jan 2022

More details in the paper: cdn.openai.com/papers/Text_a…

Arvind Neelakantan · Nov 29, 2016 · 3:31 AM UTC

Arvind Neelakantan

@arvind_io

29 Nov 2016

We get good results on real-world question answering with neural semantic parsing/program induction. Code is here: github.com/tensorflow/models…

Stat.ML Papers @StatMLPapers

29 Nov 2016

Learning a Natural Language Interface with Neural Programmer. (arXiv:1611.08945v1 [cs.CL]) ift.tt/2fvmppE

Arvind Neelakantan · May 18, 2023 · 1:45 AM UTC

Arvind Neelakantan

@arvind_io

18 May 2023

Replying to @sdand

Any feedback for us ? :)

663

Arvind Neelakantan · Nov 1, 2019 · 2:52 AM UTC

Arvind Neelakantan

@arvind_io

1 Nov 2019

In our experiments we find that: 1) our model was able to incorporate external knowledge and generate factual text response with weak supervision signal. 2) our model can incorporate medium-size knowledge bases with only 8K training examples over multiple verticals.

Arvind Neelakantan · Feb 11, 2022 · 4:43 PM UTC

Arvind Neelakantan

@arvind_io

11 Feb 2022

Replying to @bobvanluijt @SeMI_tech @CShorten30 @OpenAI

This was fun, thanks for having me!

Arvind Neelakantan · Nov 4, 2019 · 4:04 AM UTC

Arvind Neelakantan

@arvind_io

4 Nov 2019

Replying to @arvind_io @GoogleAI @Google

Implementation of Neural Assistant: Joint Action Prediction, Response Generation, and Latent Knowledge Reasoning: github.com/tensorflow/tensor…

Arvind Neelakantan · Oct 12, 2018 · 3:05 AM UTC

Arvind Neelakantan

@arvind_io

12 Oct 2018

Replying to @GaryMarcus

Things are changing : arxiv.org/abs/1810.04805 and multiple other recent work in nlp

Arvind Neelakantan · Jan 26, 2022 · 5:02 PM UTC

Arvind Neelakantan

@arvind_io

26 Jan 2022

Replying to @jobergum

our method actually zero-shot transfers better than bm25 to 11 search tasks on average as shown in the entire table. even our smallest models are better than bm25. while it is not the only way to exploit training data with bm25, we perform better than one such method docT5 query

Arvind Neelakantan · Nov 14, 2020 · 9:14 PM UTC

Arvind Neelakantan

@arvind_io

14 Nov 2020

Replying to @OriolVinyalsML @Google @TensorFlow

I still remember your super helpful LSTM language model tutorial for 2015 interns! 🙂

Arvind Neelakantan · Feb 8, 2022 · 6:51 PM UTC

Arvind Neelakantan

@arvind_io

8 Feb 2022

The code for FIQA experiments to reproduce the results in the paper using the API: nitter.app/arvind_io/status/14882… . There's no discrepancy AFAIK. 2/4

Arvind Neelakantan

@arvind_io

31 Jan 2022

Arvind Neelakantan · Feb 2, 2020 · 7:01 PM UTC

Arvind Neelakantan

@arvind_io

2 Feb 2020

Replying to @egrefen @pfau

what are the drawbacks of the benchmark/metric and any suggestions on how they can be improved ?

Arvind Neelakantan · Nov 1, 2019 · 3:04 AM UTC

Arvind Neelakantan

@arvind_io

1 Nov 2019

Work done with awesome intern Semih Yavuz and many awesome colleagues @GoogleAI @Google

Arvind Neelakantan · Jul 12, 2025 · 8:36 PM UTC

Arvind Neelakantan

@arvind_io

12 Jul 2025

Replying to @agihippo

ALT Padme GIF

639

Arvind Neelakantan · Feb 8, 2022 · 6:51 PM UTC

Arvind Neelakantan

@arvind_io

8 Feb 2022

We leave out 6 not 7 BEIR datasets.Results on MSMARCO, NQ, TriviaQA are in a separate table (Table 5 in the paper).NQ is part of BEIR too and we didn't want to repeat it.The 6 datasets we leave out are not readily available and it is common to leave them out in prior work too.3/4

Arvind Neelakantan · Dec 17, 2018 · 10:37 PM UTC

Arvind Neelakantan

@arvind_io

17 Dec 2018

Replying to @radcummings @AaronSchein

Congratulations!!!!

Arvind Neelakantan · Feb 8, 2022 · 6:51 PM UTC

Arvind Neelakantan

@arvind_io

8 Feb 2022

For example, see SPLADE v2 (arxiv.org/pdf/2109.10086.pdf) also evaluates on the same 12 BEIR datasets. Discussion from their paper: 4/4

Arvind Neelakantan · Feb 21, 2019 · 4:49 AM UTC

Arvind Neelakantan

@arvind_io

21 Feb 2019

Replying to @quocleix

Agree! But, I think once widely used brown clusters (e.g., : wing.comp.nus.edu.sg/~antho/…) should also be given credit. They use language model pre-training objective on unlabeled data and transfer the word clusters to supervised tasks. They are not "contextual" though.

Arvind Neelakantan · Sep 13, 2019 · 4:22 PM UTC

Arvind Neelakantan

@arvind_io

13 Sep 2019

Data: ai.google/tools/datasets/tas… Work done with many awesome colleagues at Google Assistant team and @GoogleAI along with student researcher Chinnadhurai Shankar

Arvind Neelakantan · Oct 13, 2018 · 9:20 PM UTC

Arvind Neelakantan

@arvind_io

13 Oct 2018

Replying to @earnmyturns @yoavgo

Also, Inductive bias of Transformer makes it easier to skip words and learn long-range dependencies compared to RNNs . This paper arxiv.org/abs/1801.10198 has some supporting experiments

Generating Wikipedia by Summarizing Long Sequences

We show that generating English Wikipedia articles can be approached as a multi- document summarization of source documents. We use extractive summarization to coarsely identify salient...

arxiv.org

Arvind Neelakantan · May 29, 2018 · 3:25 PM UTC

Arvind Neelakantan

@arvind_io

29 May 2018

Thanks! As stated in the paper, we plan to release the code with the next version of the paper.

Arvind Neelakantan · Apr 12, 2025 · 5:01 PM UTC

Arvind Neelakantan

@arvind_io

12 Apr 2025

Replying to @melvinjohnsonp @Google @GoogleDeepMind

thank you, Melvin! look forward to working with you as well :)

477

Arvind Neelakantan · Feb 22, 2019 · 3:27 AM UTC

Arvind Neelakantan

@arvind_io

22 Feb 2019

Replying to @egrefen

I think it's a little harsh to call that work flag-planting. They performed experiments on 4 real-world datasets that AFAIK were widely used by the NLP community. In comparison there were many novel methods during that period only evaluated on toy-data.

Arvind Neelakantan · Jan 26, 2022 · 6:15 PM UTC

Arvind Neelakantan

@arvind_io

26 Jan 2022

Replying to @beirmug @Nthakur20 @CShorten30 @OpenAI

Thanks for building an extremely useful benchmark!

Arvind Neelakantan · Jan 25, 2022 · 7:19 PM UTC

Arvind Neelakantan

@arvind_io

25 Jan 2022

and also impressive performance on text classification and search!

Arvind Neelakantan · Jun 19, 2020 · 10:10 PM UTC

Arvind Neelakantan

@arvind_io

19 Jun 2020

Replying to @sama

I've noticed some of these similarities as well and @paulg explains it well "A startup founder is in effect an economic research scientist." (paulgraham.com/growth.html)

Arvind Neelakantan · Sep 19, 2019 · 11:44 PM UTC

Arvind Neelakantan

@arvind_io

19 Sep 2019

I think dropout: arxiv.org/abs/1207.0580 It got into JMLR after two years but people were already using it and building upon the arxiv version

Improving neural networks by preventing co-adaptation of feature detectors

When a large feedforward neural network is trained on a small training set, it typically performs poorly on held-out test data. This "overfitting" is greatly reduced by randomly omitting half of...

arxiv.org

Arvind Neelakantan · Jan 28, 2020 · 9:40 PM UTC

Arvind Neelakantan

@arvind_io

28 Jan 2020

Replying to @quocleix @xpearhead @lmthang

Awesome work! 🙂

Arvind Neelakantan · Oct 23, 2019 · 12:37 AM UTC

Arvind Neelakantan

@arvind_io

23 Oct 2019

Paper updated with experiments on image generation.

Arvind Neelakantan

@arvind_io

12 Jun 2019

Arvind Neelakantan · Mar 1, 2024 · 1:49 AM UTC

Arvind Neelakantan

@arvind_io

1 Mar 2024

Replying to @ZhuyunDai @OpenAI

11 beir datasets used in the embeddings v1 paper: arxiv.org/abs/2201.10005

Text and Code Embeddings by Contrastive Pre-Training

Text embeddings are useful features in many applications such as semantic search and computing text similarity. Previous work typically trains models customized for different use cases, varying in...

arxiv.org

228

Arvind Neelakantan · Jan 25, 2022 · 7:18 PM UTC

Arvind Neelakantan

@arvind_io

25 Jan 2022

we see massive improvement in code search using our models!

Arvind Neelakantan · Sep 19, 2018 · 9:01 PM UTC

Arvind Neelakantan

@arvind_io

19 Sep 2018

Congratulations!!!

Arvind Neelakantan · Oct 15, 2022 · 12:34 AM UTC

Arvind Neelakantan

@arvind_io

15 Oct 2022

Replying to @doomie @poolio

Noe Cafe!

Arvind Neelakantan · Dec 25, 2021 · 6:27 PM UTC

Arvind Neelakantan

@arvind_io

25 Dec 2021

Replying to @AndrewMayne @rushbhatia

Awesome, congratulations!!!

Arvind Neelakantan · Feb 2, 2022 · 5:05 AM UTC

Arvind Neelakantan

@arvind_io

2 Feb 2022

Replying to @Thiagogm @OpenAI

Arvind Neelakantan

@arvind_io

31 Jan 2022

Arvind Neelakantan · Apr 12, 2025 · 4:56 PM UTC

Arvind Neelakantan

@arvind_io

12 Apr 2025

Replying to @JeffDean @Google @GoogleDeepMind

thank you, Jeff! so happy to be back :)

968

Arvind Neelakantan · Jul 21, 2020 · 10:57 PM UTC

Arvind Neelakantan

@arvind_io

21 Jul 2020

congratulations!!! :)

Arvind Neelakantan · Nov 9, 2019 · 12:34 PM UTC

Arvind Neelakantan

@arvind_io

9 Nov 2019

Replying to @shaneguML

Congratulations!!!

Arvind Neelakantan · Jan 27, 2022 · 8:53 PM UTC

Arvind Neelakantan

@arvind_io

27 Jan 2022

Replying to @AkhileshGotmare

ndcg@10 as done in previous work

Arvind Neelakantan · Nov 2, 2019 · 3:48 AM UTC

Arvind Neelakantan

@arvind_io

2 Nov 2019

Replying to @arvind_io @julianharris

The conversation is annotated with accept/reject. At test time we would want the third-party business to implement a boolean function that returns whether transaction can be completed.Neural Assistant will learn to work with the response as it has been annotated at training time.

Arvind Neelakantan · Jan 27, 2022 · 9:11 AM UTC

Arvind Neelakantan

@arvind_io

27 Jan 2022

Replying to @NirantK @rishabh16_

You can find the full table of results below. Even the smallest model outperforms bm-25 and its extension, docT5query

Arvind Neelakantan

@arvind_io

26 Jan 2022

Replying to @jobergum

Arvind Neelakantan · Jun 12, 2019 · 4:26 AM UTC

Arvind Neelakantan

@arvind_io

12 Jun 2019

Joint work with Daniel Duckworth, Ben Goodrich, @lukaszkaiser and Samy Bengio

Arvind Neelakantan · May 22, 2019 · 11:17 PM UTC

Arvind Neelakantan

@arvind_io

22 May 2019

congratulations!!!

Arvind Neelakantan · Oct 14, 2018 · 10:36 PM UTC

Arvind Neelakantan

@arvind_io

14 Oct 2018

Replying to @yoavgo @kentonctlee @earnmyturns @haldaume3 @andrewmccallum @ylecun

Improvement in decoding speed, as shown by some recent work in non-autoregressive machine translation

Arvind Neelakantan · Apr 12, 2025 · 4:58 PM UTC

Arvind Neelakantan

@arvind_io

12 Apr 2025

Replying to @quocleix @Google @GoogleDeepMind

thank you, Quoc! it was a great chat, felt like I never left :)

703

Arvind Neelakantan · Nov 2, 2019 · 3:53 AM UTC

Arvind Neelakantan

@arvind_io

2 Nov 2019

Replying to @arvind_io @julianharris

hope it answers your question!

Arvind Neelakantan · Apr 12, 2025 · 4:55 PM UTC

Arvind Neelakantan

@arvind_io

12 Apr 2025

Replying to @YiTayML @Google @GoogleDeepMind

thank you!

675

Arvind Neelakantan · Nov 1, 2019 · 2:54 AM UTC

Arvind Neelakantan

@arvind_io

1 Nov 2019

The model is trained at turn-level where the dialog history fed into model as input has previous ground-truth turns of the dialog. In the conversations here the actual text responses generated by model itself are used as the assistant’s side of dialog history to be fed as input.

Arvind Neelakantan · Dec 12, 2020 · 1:46 AM UTC

Arvind Neelakantan

@arvind_io

12 Dec 2020

Replying to @AndrewMayne @AmazonPub @Trident_Media

Congratulations!!!

Arvind Neelakantan · Apr 29, 2020 · 4:15 PM UTC

Arvind Neelakantan

@arvind_io

29 Apr 2020

Replying to @jaseweston

Really nice work, congratulations!!!

Arvind Neelakantan · May 31, 2019 · 9:28 PM UTC

Arvind Neelakantan

@arvind_io

31 May 2019

Replying to @dmimno

congratulations!!!

Arvind Neelakantan · Jul 20, 2021 · 4:35 AM UTC

Arvind Neelakantan

@arvind_io

20 Jul 2021

Replying to @AndrewMayne @amazonbooks @AmazonPub

Nice!

Arvind Neelakantan · Nov 2, 2019 · 3:43 AM UTC

Arvind Neelakantan

@arvind_io

2 Nov 2019

Replying to @HandNF @JeffDean

Thanks for the interest. I think Neural Assistant + Taskmaster (ai.google/tools/datasets/tas…) + Google search results as source for external knowledge can work really well for task-oriented dialog!

Arvind Neelakantan · Nov 12, 2019 · 9:49 PM UTC

Arvind Neelakantan

@arvind_io

12 Nov 2019

Congratulations!!!

Arvind Neelakantan · Dec 14, 2019 · 12:20 AM UTC

Arvind Neelakantan

@arvind_io

14 Dec 2019

Replying to @DBahdanau

Nice work! 🙂