Lihong Li · Aug 24, 2024 · 3:30 PM UTC

Lihong Li

Pinned Tweet

Lihong Li

@LihongLi20

24 Aug 2024

Amazon Rufus is an expert shopping assistant powered by GenAI. We’re hiring LLM/RL talents to work on an array of intellectually challenging science questions. Come join us in this exciting (and fun) adventure!

131

71,137

Lihong Li · Sep 26, 2020 · 11:19 PM UTC

Lihong Li

@LihongLi20

26 Sep 2020

Next week, @marcgbellemare and I are organizing a Deep RL workshop as part of Simons Institute's Theoretical RL program, with a great lineup of speakers. All talks will be recorded, and can be viewed live on YouTube channel. See simons.berkeley.edu/workshop… for more details!

Deep Reinforcement Learning

Moderators: Pablo Castro (Google), Joel Lehman (Uber), and Dale Schuurmans (University of Alberta) The success of deep neural networks in modeling complicated functions has recently been applied by...

simons.berkeley.edu

182

Lihong Li · Sep 2, 2020 · 3:03 AM UTC

Lihong Li

@LihongLi20

2 Sep 2020

Interested in reinforcement learning *without* interaction with the environment or simulator? We're organizing a @NeurIPSConf 2020 workshop on Offline RL. Visit the homepage offline-rl-neurips.github.io for more details including Call for Papers!

Rishabh Agarwal

@agarwl_

1 Sep 2020

Excited to be organizing a workshop on Offline Reinforcement Learning @NeurIPSConf 2020! CfP and other details at offline-rl-neurips.github.io. With organizers Aviral Kumar @berkeley_ai, @georgejtucker, Doina Precup @DeepMind and @LihongLi20.

151

Lihong Li · May 2, 2023 · 5:07 PM UTC

Lihong Li

@LihongLi20

2 May 2023

Truly grateful and humbled to receive the award. It's gratifying to see this 13-year old work continues to be useful, and exciting to witness how much the field has grown since then! Congrats to my coauthors, Wei, @JohnCLangford and Rob.

The Web Conference @TheWebConf

2 May 2023

#TheWebConf2023 Seoul Test of Time Award: "A Contextual-Bandit Approach to Personalized News Article Recommendation" Lihong Li (Amazon), Wei Chu (Ant Group), John Langford (Microsoft) and Robert Schapire (Microsoft) First presented at the 2010 conference. Congrats!

22,626

Lihong Li · Jan 19, 2022 · 8:09 PM UTC

Lihong Li

@LihongLi20

19 Jan 2022

Excited to share how reinforcement learning is used to delight customers in Amazon, among others!

Amazon Science

@AmazonScience

19 Jan 2022

How does the Amazon Store know what products and offers to display? Part of the answer involves reinforcement learning. Learn how scientists in @AmazonAds are developing reinforcement learning techniques to improve outcomes for customers. #machinelearning amazon.science/working-at-am…

Lihong Li · Oct 11, 2019 · 5:32 PM UTC

Lihong Li

@LihongLi20

11 Oct 2019

I'm excited to share the CfP for the Machine Learning Special Issue on RL for Real Life: springer.com/journal/10994/u… (With Alborz Geramifard, @yuxili99, @CsabaSzepesvari, Tao Wang). Deadline: March 5, 2020.

Lihong Li · Jun 25, 2020 · 2:35 AM UTC

Lihong Li

@LihongLi20

25 Jun 2020

Please come join us this weekend if you're interested in how RL is applied to the real life!

Yuxi Li @yuxili99

25 Jun 2020

It is exciting our RL for Real Life 2020 Virtual Conference is approaching on June 27-28, sites.google.com/view/RL4Rea…, co-organized with @gabepsilon, Alborz Geramifard, Omer Gottesman, @LihongLi20, Anusha Nagabandi, @TonyZQin, @CsabaSzepesvari.

Lihong Li · Sep 28, 2020 · 8:13 PM UTC

Lihong Li

@LihongLi20

28 Sep 2020

Awesome Day 1 of the Deep RL workshop. Enjoyed the excellent talks by @tengyuma @EmmaBrunskill @svlevine @ofirnachum . Thanks every one for participating. Looking forward to Day 2! @SimonsInstitute

Marc G. Bellemare @marcgbellemare

28 Sep 2020

Excited to kick off the Deep Reinforcement Learning theory workshop at the Simons Institute today, co-organized with @LihongLi20 . Today's topic is Offline reinforcement learning 🔥 Schedule is here: simons.berkeley.edu/workshop…

Lihong Li · Sep 29, 2020 · 7:11 PM UTC

Lihong Li

@LihongLi20

29 Sep 2020

Another fantastic day at the Deep RL workshop! Thx to @IanOsband @chelseabfinn @wwdabney Alekh Agarwal for the wonderful talks, and inspiring discussions moderated by Joel Lehman. All sessions recorded. Looking forward to tomorrow (optimization!) @marcgbellemare @SimonsInstitute

Lihong Li · Dec 5, 2019 · 7:56 AM UTC

Lihong Li

@LihongLi20

5 Dec 2019

As co-organizer, I'm super excited about the program and looking forward to it next week. Come join us at #NeurIPS2019 if you're curious about how the optimization toolkit helps to design, unify and analyze RL algorithms!

Bo Dai @daibond_alpha

5 Dec 2019

The schedule and accepted papers are released: optrl2019.github.io/. Congratulations to all the recipients of the travel awards. We thank all the invited speakers, panelists and authors. Thanks to our sponsors @GoogleAI and @DeepMindAI. See you in Vancouver next week.

Lihong Li · May 19, 2021 · 12:24 AM UTC

Lihong Li

@LihongLi20

19 May 2021

I'm looking for an Applied Scientist with strong ML/Stats background to join our team in Amazon Advertising. The position is based in New York City: amazon.jobs/en/jobs/1544000/…. Please consider applying!

Lihong Li · Dec 11, 2020 · 5:12 PM UTC

Lihong Li

@LihongLi20

11 Dec 2020

One more day before the Offline Reinforcement Learning Workshop at @NeurIPSConf. Consider submitting questions to the panelists at offline-rl-neurips.github.io… . See you tomorrow! #OFFLINERL2020

Lihong Li · Oct 1, 2020 · 7:57 PM UTC

Lihong Li

@LihongLi20

1 Oct 2020

Thanks to @jacobandreas @clarelyle @yayitsamyzhang Doina Precup & @ShamKakade6 for the wonderful talks at the deep RL workshop, and to the audience, esp. given how close the @iclr_conf deadline is. Come join us tomorrow to recover from the deadline craziness! 😀 @marcgbellemare

Lihong Li · Dec 13, 2020 · 10:44 PM UTC

Lihong Li

@LihongLi20

13 Dec 2020

Look forward to talking at the AI for Economics seminar [aiforeconomics.com] on 12/15. There is a natural connection b/t off-policy #ReinforcementLearning & econometrics. Thx to the organizers (David Parkes, @alexrtrott, @StephanZheng) for inviting!

Lihong Li · Apr 6, 2021 · 5:20 PM UTC

Lihong Li

@LihongLi20

6 Apr 2021

We are opening an exciting Early-career Scientist program at Amazon Advertising, to attract talent to innovate on behalf of our customers and publish their cutting-edge research. Please consider applying and share broadly. Application deadline: May 14. amazon.science/amazon-advert…

Amazon Advertising opens applications for early career scientists

The new program, which offers full-time two-year positions, is aimed at recent PhD graduates who want to innovate, publish, and have their work impact millions of customers. The application deadline...

amazon.science

Lihong Li · Jul 8, 2020 · 5:00 PM UTC

Lihong Li

@LihongLi20

8 Jul 2020

A systematic study of long-horizon off-policy evaluation via duality! Related to an earlier doubly robust work: openreview.net/forum?id=S1gl… , but in the more general behavior-agnostic setting, and with a more careful investigation of various algorithmic choices in the design space.

Doubly Robust Bias Reduction in Infinite Horizon Off-Policy Estimation

We develop a new doubly robust estimator based on the infinite horizon density ratio and off policy value estimation.

openreview.net

Ofir Nachum @ofirnachum

8 Jul 2020

Policy evaluation via duality/Lagrangian methods presents a lot of choices (how to setup the LPs, regularize them, etc). In arxiv.org/abs/2007.03438 we examine how these choices affect accuracy of final eval. Lots of insights in this paper, many of which I didn't expect....

Lihong Li · Sep 30, 2020 · 7:24 PM UTC

Lihong Li

@LihongLi20

30 Sep 2020

Learned a great deal today about how to do better optimization in deep RL from excellent talks by Matthieu Geist, Nevena Lazic, @pabbeel & Martha White. Esp enjoyed the discussions (thx @neu_rips for "spicy" questions). Can't wait for tomorrow! @marcgbellemare @SimonsInstitute

Lihong Li · Dec 13, 2023 · 5:57 AM UTC

Lihong Li

@LihongLi20

13 Dec 2023

Replying to @denny_zhou @ysu_nlp

And you don't need *train*ing :)

806

Lihong Li · Dec 14, 2019 · 6:04 AM UTC

Lihong Li

@LihongLi20

14 Dec 2019

Congrats to the authors! Looking forward to the workshop tomorrow.

This Post is from an account that no longer exists.

Lihong Li · Mar 19, 2022 · 12:13 AM UTC

Lihong Li

@LihongLi20

19 Mar 2022

My team is hiring a Data Scientist to extract critical insights from data and influence customer-facing shopping experiences in Amazon Ads: amazon.jobs/en/jobs/1990034/…. Please reach out if you are interested! #Amazon #advertising #DataScience #dataScientist #Statistics

Lihong Li · Jun 13, 2019 · 8:55 PM UTC

Lihong Li

@LihongLi20

13 Jun 2019

Vote for questions you find interesting for the panel discussion: tricider.com/brainstorming/2… , as part of the RL for Real Life workshop sites.google.com/view/RL4Rea… . Please do so by 10:30am June 14 PDT!

Lihong Li · Feb 26, 2021 · 4:31 PM UTC

Lihong Li

@LihongLi20

26 Feb 2021

Replying to @lilianweng

Interesting idea. On the other hand, a paper should be self-contained, so just describing the differences from another paper probably won't work in most cases.

Lihong Li · Aug 24, 2024 · 3:31 PM UTC

Lihong Li

@LihongLi20

24 Aug 2024

Rufus leverages Amazon’s vast product knowledge, customer reviews, community Q&As, and more, to inspire and answer questions throughout your shopping journey. More about Rufus: aboutamazon.com/news/retail/…

Amazon announces Rufus, a new generative AI-powered conversational shopping experience

With Rufus, customers are now able to shop alongside a generative AI-powered expert that knows Amazon’s selection inside and out, and can bring it all together with information from across the web to...

aboutamazon.com

2,898

Lihong Li · Aug 24, 2024 · 10:59 PM UTC

Lihong Li

@LihongLi20

24 Aug 2024

Here are a few openings, with more on the way! amazon.jobs/en/jobs/2728692/… amazon.jobs/en/jobs/2728534/… amazon.jobs/en/jobs/2728712/…

2,370

Lihong Li · Nov 12, 2019 · 4:17 AM UTC

Lihong Li

@LihongLi20

12 Nov 2019

Replying to @nanjiang_cs

Will continue reading the details. In our recent paper [arxiv.org/abs/1910.07186] we were similarly surprised by the interplay between value functions and importance ratios, where we found a similar estimator (using V instead of Q). [1/2]

Lihong Li · Aug 24, 2024 · 3:38 PM UTC

Lihong Li

@LihongLi20

24 Aug 2024

Feel free to message me if interested!

3,009

Lihong Li · Jul 8, 2020 · 5:00 PM UTC

Lihong Li

@LihongLi20

8 Jul 2020

An intriguing and rather surprising finding is the superiority of working with the duals (state visitation distributions) over the primal (value functions), although the latter has been the "default" approach in much of RL literature.

Lihong Li · Nov 12, 2019 · 4:22 AM UTC

Lihong Li

@LihongLi20

12 Nov 2019

Replying to @LihongLi20 @nanjiang_cs

You may also find Sec 4 interesting, which makes explicit the connection to Lagrangian duality (as hinted at the end of your paper). Still reading/enjoying your paper. Many interesting stuffs at the intersection of optimization & RL! [2/2]

Lihong Li · Jun 2, 2023 · 7:05 AM UTC

Lihong Li

@LihongLi20

2 Jun 2023

Replying to @yisongyue @nanjiang_cs

To make things harder, most environments are non-stationary. We may model nonstationarity by hidden state variables (at least conceptually), but will need strong assumptions to handle it effectively (to my best knowledge).

289

Lihong Li · Jun 6, 2020 · 6:16 PM UTC

Lihong Li

@LihongLi20

6 Jun 2020

Replying to @mlittmancs @BrownHCRI

CONGRATS!

Lihong Li · Oct 11, 2019 · 5:22 PM UTC

Lihong Li

@LihongLi20

11 Oct 2019

Replying to @SoloGen

A perhaps trickier situation: what if you remember the claim, but forget how to prove it...

Lihong Li · Sep 26, 2020 · 11:06 PM UTC

Lihong Li

@LihongLi20

26 Sep 2020

Replying to @nanjiang_cs

Yes :) with @daibond_alpha, @ofirnachum, Yinlam Chow, @CsabaSzepesvari and Dale Schuurmans.

Lihong Li · Jul 2, 2020 · 1:48 AM UTC

Lihong Li

@LihongLi20

2 Jul 2020

Replying to @ShamKakade6 @arkrause

Congrats!!

Lihong Li · Mar 26, 2020 · 9:52 PM UTC

Lihong Li

@LihongLi20

26 Mar 2020

Replying to @nanjiang_cs

Same for me; took me 3 days to realize they are different...

Lihong Li · Sep 26, 2020 · 10:58 PM UTC

Lihong Li

@LihongLi20

26 Sep 2020

Replying to @nanjiang_cs

Congrats! The name change is helpful. Incidentally, we have a paper accepted to NeurIPS that is about "confidence" intervals. :-)

Lihong Li · Oct 10, 2019 · 6:07 AM UTC

Lihong Li

@LihongLi20

10 Oct 2019

Replying to @CsabaSzepesvari

Thanks, Csaba. Would be great to have you at the workshop!

Lihong Li · Oct 22, 2024 · 3:22 AM UTC

Lihong Li

@LihongLi20

22 Oct 2024

Replying to @mlittmancs @mitpress

Congrats! It's on my to-listen list :)

179

Lihong Li · Jul 8, 2020 · 8:13 PM UTC

Lihong Li

@LihongLi20

8 Jul 2020

Replying to @zicokolter @roderickmelrose @_vaishnavh

Looks cool!

Lihong Li · Jun 19, 2020 · 10:31 PM UTC

Lihong Li

@LihongLi20

19 Jun 2020

Replying to @edchi

Very sorry for your loss, Ed! Thanks for sharing these beautiful memories. Your father was truly amazing.

Lihong Li · Dec 3, 2020 · 2:48 AM UTC

Lihong Li

@LihongLi20

3 Dec 2020

Replying to @marcgbellemare

This is awesome! Huge congrats!

Lihong Li · May 30, 2024 · 11:07 PM UTC

Lihong Li

@LihongLi20

30 May 2024

Replying to @nanjiang_cs

BIG congrats!!!

403

Lihong Li · May 13, 2024 · 6:22 PM UTC

Lihong Li

@LihongLi20

13 May 2024

Replying to @yubai01 @OpenAI

Congrats!

460

Lihong Li · Sep 30, 2020 · 5:53 PM UTC

Lihong Li

@LihongLi20

30 Sep 2020

Replying to @neu_rips

Looking forward to it!

Lihong Li · Feb 16, 2020 · 7:45 AM UTC

Lihong Li

@LihongLi20

16 Feb 2020

Replying to @nanjiang_cs

It can be confusing, unless the math is shown... :-(

Lihong Li · Jan 23, 2023 · 9:17 PM UTC

Lihong Li

@LihongLi20

23 Jan 2023

Replying to @shaneguML @GoogleAI @OpenAI @johnschulman2

Congrats!

659

Lihong Li · Jun 13, 2020 · 1:58 AM UTC

Lihong Li

@LihongLi20

13 Jun 2020

Replying to @HoangMinhLe @yisongyue

BIG congrats!

Lihong Li · Dec 9, 2019 · 9:26 PM UTC

Lihong Li

@LihongLi20

9 Dec 2019

Congratulations!

Lihong Li · May 13, 2020 · 7:51 AM UTC

Lihong Li

@LihongLi20

13 May 2020

Replying to @hanzhao_ml @IllinoisCS

Congratulations!

Lihong Li · Jan 22, 2023 · 6:56 PM UTC

Lihong Li

@LihongLi20

22 Jan 2023

Replying to @edchi

Congrats @edchi ! Well deserved!

408

Lihong Li · Jul 15, 2020 · 11:14 PM UTC

Lihong Li

@LihongLi20

15 Jul 2020

Replying to @mlittmancs @aaas @BrownHCRI @BrownBigAI

Looking forward!

Lihong Li · Oct 31, 2024 · 6:44 AM UTC

Lihong Li

@LihongLi20

31 Oct 2024

Replying to @SebastienBubeck @OpenAI @sama

Congrats, Seb!!

323

Lihong Li · Nov 12, 2019 · 12:07 AM UTC

Lihong Li

@LihongLi20

12 Nov 2019

Replying to @nanjiang_cs

Super cool & interesting!

Lihong Li · Dec 12, 2020 · 12:05 AM UTC

Lihong Li

@LihongLi20

12 Dec 2020

Replying to @0xJChen @NeurIPSConf

My short answer is no: they are difficult in different ways as the settings are different. For a longer answer (or different opinion), submit your question and come to the panel tomorrow! 😀

Lihong Li · May 12, 2020 · 3:28 AM UTC

Lihong Li

@LihongLi20

12 May 2020

Replying to @SimonShaoleiDu

Congrats and welcome to Seattle!

Lihong Li · May 10, 2024 · 11:12 PM UTC

Lihong Li

@LihongLi20

10 May 2024

Replying to @ShunyuYao12 @karthik_r_n @princeton_nlp @PrincetonPLI

Congrats!

464

Lihong Li · Jun 2, 2020 · 6:36 AM UTC

Lihong Li

@LihongLi20

2 Jun 2020

Replying to @nanjiang_cs

Impressive that you got signals out of my very random noise (question)! ;)

Lihong Li · Jul 17, 2020 · 4:31 AM UTC

Lihong Li

@LihongLi20

17 Jul 2020

Replying to @jiajunwu_cs

Congrats!!

Lihong Li · Oct 10, 2020 · 8:20 PM UTC

Lihong Li

@LihongLi20

10 Oct 2020

Replying to @neu_rips

for what...?

Lihong Li · Feb 20, 2024 · 8:24 PM UTC

Lihong Li

@LihongLi20

20 Feb 2024

Replying to @SimonShaoleiDu @SloanFoundation

Congrats! Very well-deserved!

157

Lihong Li · May 13, 2020 · 7:39 AM UTC

Lihong Li

@LihongLi20

13 May 2020

Replying to @yisongyue @Caltech

Congratulations!!

Lihong Li · Sep 6, 2019 · 6:44 AM UTC

Lihong Li

@LihongLi20

6 Sep 2019

Replying to @mtoneva1 @chrodan

Congratulations!!

Lihong Li · May 2, 2024 · 1:19 AM UTC

Lihong Li

@LihongLi20

2 May 2024

Replying to @kchonyc @nanjiang_cs

should still work: microsoft.com/en-us/research… :)

Multiworld Testing - Microsoft Research

Exponentially better than A/B testing. Multiworld Testing (MWT) is the capability to test and optimize over K policies (context-based decision rules) using an amount of data and computation that...

microsoft.com

184

Lihong Li · Feb 20, 2024 · 8:25 PM UTC

Lihong Li

@LihongLi20

20 Feb 2024

Replying to @nanjiang_cs

Congrats! Very well-deserved!

257

Lihong Li · Apr 21, 2020 · 2:05 PM UTC

Lihong Li

@LihongLi20

21 Apr 2020

Replying to @mlittmancs @BrownHCRI @BrownBigAI

Congratulations!

Lihong Li · Jun 10, 2024 · 2:36 PM UTC

Lihong Li

@LihongLi20

10 Jun 2024

Replying to @zicokolter

Congrats, Zico!

232

Lihong Li · Dec 15, 2023 · 11:51 PM UTC

Lihong Li

@LihongLi20

15 Dec 2023

Replying to @StanfordDBDS

Congrats @james_y_zou! Very well-deserved!

398

Lihong Li · Jun 1, 2020 · 5:37 AM UTC

Lihong Li

@LihongLi20

1 Jun 2020

Replying to @nanjiang_cs

Congrats!

Lihong Li · Oct 19, 2019 · 11:01 PM UTC

Lihong Li

@LihongLi20

19 Oct 2019

Replying to @LihongLi20 @pierrelux @yaoliucs @EmmaBrunskill

2/3 One quick comment: As the paper already points out, the Stationary IS studied here is essentially the marginalized IS of Xie et al., not the distributions studied in HM/LLTZ/GB. You seem to suggest the difference can be removed by "taking T → ∞ when necessary" (page 3) ...

Lihong Li · Oct 19, 2019 · 11:02 PM UTC

Lihong Li

@LihongLi20

19 Oct 2019

Replying to @LihongLi20 @pierrelux @yaoliucs @EmmaBrunskill

3/3 ... but it seems tricky. Eg, as T → ∞, IS/PDIS can have infinite variance (see LLTZ for an example), but probably not for the methods in HM/LLTZ/GB.

Lihong Li · Feb 21, 2024 · 1:49 AM UTC

Lihong Li

@LihongLi20

21 Feb 2024

Replying to @chijinML

Congrats! So well-deserved!

193

Lihong Li · Oct 24, 2019 · 4:40 PM UTC

Lihong Li

@LihongLi20

24 Oct 2019

Replying to @tengyuma

Interesting! Esp. the nice & simple example that illustrates the exponential gap between representing a model and a value function. Reminds me of earlier work that shows a similar exponential gap, eg in the context of factored MDPs (Boutilier et al.: doi.org/10.1613/jair.575).