Zhiqing Sun · Apr 8, 2026 · 4:47 PM UTC

Zhiqing Sun

Pinned Tweet

Zhiqing Sun

@EdwardSun0909

Apr 8

Excited to share Muse Spark, the first model from whole team’s work in MSL! 🚀 It’s natively multimodal and agentic. I’ve been using it for my daily coding and research tasks. Still plenty of room to improve in agentic domains, but we’re moving with great velocity. It’s a seriously good model! Check out the full breakdown and try it out in meta.ai

Meta AI

Use Meta AI assistant to get things done, create AI-generated images for free, and get answers to any of your questions.

meta.ai

Alexandr Wang

@alexandr_wang

Apr 8

1/ today we're releasing muse spark, the first model from MSL. nine months ago we rebuilt our ai stack from scratch. new infrastructure, new architecture, new data pipelines. muse spark is the result of that work, and now it powers meta ai. 🧵

205

22,417

Zhiqing Sun · Feb 28, 2025 · 7:41 PM UTC

Zhiqing Sun

@EdwardSun0909

28 Feb 2025

I successfully defended my PhD thesis today! 🎉 "Scalable Alignment of Large Language Models Towards Truth-Seeking, Complex Reasoning, and Human Values" Slides (Fact-RLHF, Lean-STaR, Easy-to-Hard Generalization, Self-Align, Instructable Reward Model): docs.google.com/presentation… A huge thank you to my thesis committee and all attendees for their valuable feedback and support! ❤️ @wellecks @lileics @denny_zhou & Yiming

1,237

98,483

Zhiqing Sun · Feb 3, 2025 · 12:55 AM UTC

Zhiqing Sun

@EdwardSun0909

3 Feb 2025

Excited to finally share what I’ve been working on since joining OpenAI last June! The goal of deep-research is to enable reasoning models with tools to tackle long-horizon tasks in the real world and discover new knowledge. It’s a highly autonomous agent—hand it a hard problem, grab a coffee, and come back to a well-researched solution in 10–30 minutes. Trained end-to-end with reinforcement learning in a tool-enabled environment, deep-research is built to seek truth and understand the universe. A key milestone is its performance on humanity’s "last exam," demonstrating the true power of an end-to-end trained agent. 2025 is the year of agents. Looking forward to what’s ahead! openai.com/index/introducing…

Introducing deep research

An agent that uses reasoning to synthesize large amounts of online information and complete multi-step research tasks for you. Available to Pro users today, Plus and Team next.

openai.com

980

165,390

Zhiqing Sun · Apr 10, 2025 · 6:20 PM UTC

Zhiqing Sun

@EdwardSun0909

10 Apr 2025

We’re releasing BrowseComp, which stands for Browsing Competition. 🏎️ Think of it like coding or math competitions — while these contests may not perfectly reflect real-world SWE or mathematical research, they do capture a spark of intelligence. This is THE benchmark we should care about when evaluating the intelligence of deep research-like browsing agents.

OpenAI

@OpenAI

10 Apr 2025

We’re open-sourcing BrowseComp (“Browsing Competition”), a new, challenging benchmark designed to test how well AI agents can browse the internet to find hard-to-locate information. It’s like an online scavenger hunt…but for browsing agents. openai.com/index/browsecomp/

913

473,431

Zhiqing Sun · Jan 27, 2025 · 1:19 PM UTC

Zhiqing Sun

@EdwardSun0909

27 Jan 2025

Bad take (opinions are my own)

Steven Heidel

@stevenheidel

27 Jan 2025

americans sure love giving their data away to the CCP in exchange for free stuff

Community note

DeepSeek can be run locally without an internet connection, unlike OpenAI's models. github.com/deepseek-a

826

134,408

Zhiqing Sun · Aug 14, 2025 · 8:46 PM UTC

Zhiqing Sun

@EdwardSun0909

14 Aug 2025

Excited to share that I recently joined the MSL team! Building personal superintelligence is serious and fun here. Join us!

Hyung Won Chung

@hwchung27

14 Aug 2025

After a great time at OpenAI, we (@EdwardSun0909, @_jasonwei) recently joined @Meta Superintelligence Labs. The first month has already been so much fun building from a clean slate with a truly talent-dense team! Very excited about the compute and long term focus of the new lab

846

269,524

Zhiqing Sun · Dec 20, 2024 · 6:49 PM UTC

Zhiqing Sun

@EdwardSun0909

20 Dec 2024

honored to have contributed to o3😎

ARC Prize

@arcprize

20 Dec 2024

New verified ARC-AGI-Pub SoTA! @OpenAI o3 has scored a breakthrough 75.7% on the ARC-AGI Semi-Private Evaluation. And a high-compute o3 configuration (not eligible for ARC-AGI-Pub) scored 87.5% on the Semi-Private Eval. 1/4

772

101,081

Zhiqing Sun · Jul 16, 2025 · 11:46 PM UTC

Zhiqing Sun

@EdwardSun0909

16 Jul 2025

You can just do things 🖱️

OpenAI

@OpenAI

16 Jul 2025

674

90,322

Zhiqing Sun · Jan 1, 2025 · 12:56 AM UTC

Zhiqing Sun

@EdwardSun0909

1 Jan 2025

Challenge accepted — 2025 will be our best year yet!

Sam Altman

@sama

30 Dec 2024

common themes: AGI agents much better 4o upgrade much better memory longer context “grown up mode” deep research feature better sora more personalization (interestingly, many great updates we have coming were mentioned not at all or very little!)

648

135,195

Zhiqing Sun · Feb 25, 2025 · 6:48 PM UTC

Zhiqing Sun

@EdwardSun0909

25 Feb 2025

We’re rolling out Deep Research to Plus users today! Deep Research was the biggest “Feel The AGI” moment I’ve ever had since ChatGPT. And I’m glad more people will experience their first AGI moment! The team also worked super hard to make more tools including image citations / python / user files etc available to the model in this launch!

OpenAI

@OpenAI

25 Feb 2025

Replying to @OpenAI

We're also sharing the system card, detailing how we built deep research, assessed its capabilities and risks, and improved safety. openai.com/index/deep-resear…

480

49,265

Zhiqing Sun · Jul 18, 2025 · 8:23 PM UTC

Zhiqing Sun

@EdwardSun0909

18 Jul 2025

just tried and the agent solved level 1 in its own browser lol. thanks for creating the benchmark!

ARC Prize

@arcprize

18 Jul 2025

Replying to @arcprize

o3 (left) and Grok 4 (right) replays below spoiler: neither complete a single level

458

106,451

Zhiqing Sun · Jul 17, 2025 · 6:29 PM UTC

Zhiqing Sun

@EdwardSun0909

17 Jul 2025

Excited to share the agent with the world! It’s a good agent!

OpenAI

@OpenAI

17 Jul 2025

ChatGPT can now do work for you using its own computer. Introducing ChatGPT agent—a unified agentic system combining Operator’s action-taking remote browser, deep research’s web synthesis, and ChatGPT’s conversational strengths.

425

78,834

Zhiqing Sun · Jul 19, 2025 · 8:44 AM UTC

Zhiqing Sun

@EdwardSun0909

19 Jul 2025

I heard reinforcement learning only works with verifiable rewards? 😛 Congrats!!

Alexander Wei

@alexwei_

19 Jul 2025

1/N I’m excited to share that our latest @OpenAI experimental reasoning LLM has achieved a longstanding grand challenge in AI: gold medal-level performance on the world’s most prestigious math competition—the International Math Olympiad (IMO).

407

33,608

Zhiqing Sun · Sep 28, 2025 · 3:11 AM UTC

Zhiqing Sun

@EdwardSun0909

28 Sep 2025

I don’t often tweet on technical topics but I may have an opposite opinion here…

This tweet is unavailable

376

90,344

Zhiqing Sun · Feb 22, 2023 · 8:37 PM UTC

Zhiqing Sun

@EdwardSun0909

22 Feb 2023

How can LLMs such as GPT-3 and ChatGPT achieve greater factual accuracy without relying on an external retrieval search engine? Our #ICLR2023 paper shows that recitation can help - like humans! Recitation-Augmented Language Models arxiv.org/abs/2210.01296 1/N

364

59,356

Zhiqing Sun · Apr 11, 2024 · 5:23 PM UTC

Zhiqing Sun

@EdwardSun0909

11 Apr 2024

Our research on easy-to-hard generalization will be supported by the OpenAI Superalignment Fast Grant. Congratulations to the team and stay tuned!😎

Zhiqing Sun

@EdwardSun0909

21 Mar 2024

🌟Easy-to-Hard Generalization: Scalable Alignment Beyond Human Supervision 🌟 arxiv.org/abs/2403.09472 How can we keep improving AI systems when their capabilities surpass those of human supervisors? (1/n)

342

53,780

Zhiqing Sun · Apr 10, 2025 · 5:27 PM UTC

Zhiqing Sun

@EdwardSun0909

10 Apr 2025

Memory is the next scaling laws paradigm shift

OpenAI

@OpenAI

10 Apr 2025

Starting today, memory in ChatGPT can now reference all of your past chats to provide more personalized responses, drawing on your preferences and interests to make it even more helpful for writing, getting advice, learning, and beyond.

317

26,840

Zhiqing Sun · Oct 11, 2023 · 5:26 PM UTC

Zhiqing Sun

@EdwardSun0909

11 Oct 2023

🚀 Can RLAIF fully replace RLHF to align language models from scratch, enhancing both their alignment and capabilities? SALMON introduces a principle-following reward model in the realm of self-alignment, using just 6 ICL exemplars and 31 principles to outperform LLaMA-2-Chat!

295

100,070

Zhiqing Sun · Nov 7, 2025 · 1:17 AM UTC

Zhiqing Sun

@EdwardSun0909

7 Nov 2025

the real agi competition is between vllm and sglang

273

32,998

Zhiqing Sun · Mar 21, 2024 · 2:41 PM UTC

Zhiqing Sun

@EdwardSun0909

21 Mar 2024

256

106,753

Zhiqing Sun · Feb 6, 2025 · 7:25 PM UTC

Zhiqing Sun

@EdwardSun0909

6 Feb 2025

All I see is @GaryMarcus saying “Deep Research is genuinely useful” 🙂

Gary Marcus

@GaryMarcus

6 Feb 2025

Deep Research is genuinely useful - depending on your application - but crucially (as anticipated by Rebooting AI in 2019, and by @yudapearl) facts and temporal reasoning remain problematic for current neural network-based approaches that lean heavily on statistics rather than deep understanding.

211

67,798

Zhiqing Sun · Aug 9, 2025 · 9:08 PM UTC

Zhiqing Sun

@EdwardSun0909

9 Aug 2025

gpt-5-reasoning is a good model 🫡

Artificial Analysis

@ArtificialAnlys

7 Aug 2025

OpenAI gave us early access to GPT-5: our independent benchmarks verify a new high for AI intelligence. We have tested all four GPT-5 reasoning effort levels, revealing 23x differences in token usage and cost between the ‘high’ and ‘minimal’ options and substantial differences in intelligence We have run our full suite of eight evaluations independently across all reasoning effort configurations of GPT-5 and are reporting benchmark results for intelligence, token usage, and end-to-end latency. What @OpenAI released: OpenAI has released a single endpoint for GPT-5, but different reasoning efforts offer vastly different intelligence. GPT-5 with reasoning effort “High” reaches a new intelligence frontier, while “Minimal” is near GPT-4.1 level (but more token efficient). Takeaways from our independent benchmarks: ⚙️ Reasoning effort configuration: GPT-5 offers four reasoning effort configurations: high, medium, low, and minimal. Reasoning effort options steer the model to “think” more or less hard for each query, driving large differences in intelligence, token usage, speed, and cost. 🧠 Intelligence achieved ranges from frontier to GPT-4.1 level: GPT-5 sets a new standard with a score of 68 on our Artificial Analysis Intelligence Index (MMLU-Pro, GPQA Diamond, Humanity’s Last Exam, LiveCodeBench, SciCode, AIME, IFBench & AA-LCR) at High reasoning effort. Medium (67) is close to o3, Low (64) sits between DeepSeek R1 and o3, and Minimal (44) is close to GPT-4.1. While High sets a new standard, the increase over o3 is not comparable to the jump from GPT-3 to GPT-4 or GPT-4o to o1. 💬 Token usage varies 23x between reasoning efforts: GPT-5 with High reasoning effort used more tokens than o3 (82M vs. 50M) to complete our Index, but still fewer than Gemini 2.5 Pro (98M) and DeepSeek R1 0528 (99M). However, Minimal reasoning effort used only 3.5M tokens which is substantially less than GPT-4.1, making GPT-5 Minimal significantly more token-efficient for similar intelligence. 📖 Long Context Reasoning: We released our own Long Context Reasoning (AA-LCR) benchmark earlier this week to test the reasoning capabilities of models across long sequence lengths (sets of documents ~100k tokens in total). GPT-5 stands out for its performance in AA-LCR, with GPT-5 in both High and Medium reasoning efforts topping the benchmark. 🤖 Agentic Capabilities: OpenAI also commented on improvements across capabilities increasingly important to how AI models are used, including agents (long horizon tool calling). We recently added IFBench to our Intelligence Index to cover instruction following and will be adding further evals to cover agentic tool calling to independently test these capabilities. 📡 Vibe checks: We’re testing the personality of the model through MicroEvals on our website which supports running the same prompt across models and comparing results. It’s free to use, we’ll provide an update with our perspective shortly but feel free to share your own! See below for further analysis:

205

21,382

Zhiqing Sun · Apr 24, 2025 · 8:48 PM UTC

Zhiqing Sun

@EdwardSun0909

24 Apr 2025

deep research mini is here 🔭 share your feedback with us!

OpenAI

@OpenAI

24 Apr 2025

Replying to @OpenAI

The lightweight version of deep research is powered by a version of OpenAI o4-mini and is nearly as intelligent as the deep research people already know and love, while being significantly cheaper to serve. Responses will typically be shorter while maintaining the depth and quality you’ve come to expect. Once limits for the original version of deep research are reached, queries automatically default to the lightweight version.

160

11,781

Zhiqing Sun · Feb 2, 2025 · 8:33 PM UTC

Zhiqing Sun

@EdwardSun0909

2 Feb 2025

🔭 Understand the Universe.

161

18,775

Zhiqing Sun · May 3, 2024 · 9:44 PM UTC

Zhiqing Sun

@EdwardSun0909

3 May 2024

⭐Self-Play Preference Optimization for Language Model Alignment⭐ arxiv.org/abs/2405.00675 Bradley-Terry models in RLHF fall short in capturing the intransitivity and irrationality in human preferences. How can we identify the Nash equilibrium policy with general preferences?🧵

152

26,993

Zhiqing Sun · Feb 2, 2025 · 9:33 PM UTC

Zhiqing Sun

@EdwardSun0909

2 Feb 2025

🔭🔭🔭

OpenAI

@OpenAI

2 Feb 2025

Deep Research Live from Tokyo 4pm PT / 9am JST Stay tuned for link to livestream.

133

21,034

Zhiqing Sun · Feb 20, 2025 · 4:14 AM UTC

Zhiqing Sun

@EdwardSun0909

20 Feb 2025

OK we need a good benchmark for all Deep Research-like products to quantitatively tell who’s the deepest researcher

Jimmy Ba

@jimmybajimmyba

20 Feb 2025

🔭 Understand the Universe in less than a min. Grok 3 DeepSearch

137

15,889

Zhiqing Sun · Mar 5, 2025 · 9:02 PM UTC

Zhiqing Sun

@EdwardSun0909

5 Mar 2025

GPT-4.5 surely memorizes lots of knowledge in its weights :)

123

11,694

Zhiqing Sun · Dec 14, 2023 · 5:02 AM UTC

Zhiqing Sun

@EdwardSun0909

14 Dec 2023

Haha, thanks! We used this template: s-ink.org/betterposter-poste…

‘Betterposter’ poster template

Templates for the 'Betterposter' poster design by Mike Morrison to effectively create effective scientific posters.

s-ink.org

Konstantin Mishchenko

@konstmish

14 Dec 2023

I wish every poster at every conference looked like this. The idea that all text should be same size is just wrong.

122

17,840

Zhiqing Sun · May 22, 2023 · 2:14 AM UTC

Zhiqing Sun

@EdwardSun0909

22 May 2023

Beating Alpaca or Davinci003 with only 1k samples is indeed impressive! Personally, I find myself in alignment with their Superficial Alignment Hypothesis, as in Self-Align, we have shown that a mere set of 16 rules is sufficient to outperform Alpaca or Davinci003!

@_akhaliq

22 May 2023

LIMA: Less Is More for Alignment LIMA, a 65B parameter LLaMa language model fine-tuned with the standard supervised loss on only 1,000 carefully curated prompts and responses, without any reinforcement learning or human preference modeling. LIMA demonstrates remarkably strong performance, learning to follow specific response formats from only a handful of examples in the training data, including complex queries that range from planning trip itineraries to speculating about alternate history. Moreover, the model tends to generalize well to unseen tasks that did not appear in the training data. In a controlled human study, responses from LIMA are either equivalent or strictly preferred to GPT-4 in 43% of cases; this statistic is as high as 58% when compared to Bard and 65% versus DaVinci003, which was trained with human feedback paper page: huggingface.co/papers/2305.1…

116

33,429

Zhiqing Sun · Oct 12, 2023 · 10:47 PM UTC

Zhiqing Sun

@EdwardSun0909

12 Oct 2023

📢 One detail we didn't spotlight earlier: Dromedary-2 might just be the world's best open-source, non-distilled LLM for commercial use! 🌍🚀 Here's a comparison with other baselines from the Humpback paper. Dromedary-2 notably pushes the boundaries on info extraction and math!

Zhiqing Sun

@EdwardSun0909

11 Oct 2023

120

39,191

Zhiqing Sun · Oct 13, 2023 · 6:41 PM UTC

Zhiqing Sun

@EdwardSun0909

13 Oct 2023

Absolutely thrilled to be a recipient of the 2023 Google PhD Fellowship! Deep gratitude to my advisors/mentors Yiming, Xuezhi, @denny_zhou , and all my dedicated collaborators. Also, thanks for the generous support from @GoogleAI @Google.

Google AI

@GoogleAI

13 Oct 2023

In 2009, Google created the PhD Fellowship Program to recognize and support outstanding graduate students pursuing exceptional research in computer science and related fields. Today, we congratulate the recipients of the 2023 Google PhD Fellowship! goo.gle/3PYfLXl

120

18,205

Zhiqing Sun · Dec 7, 2023 · 7:04 PM UTC

Zhiqing Sun

@EdwardSun0909

7 Dec 2023

I'm usually skeptical when people say DPO achieves similar results as PPO, especially as DPO models often stem from GPT-4, making it more like knowledge distillation. But now, my favorite project, alpacafarm, has just confirmed this w/o kd! Wow, definitely something real here!😱

This Post is from an account that no longer exists.

110

23,297

Zhiqing Sun · Apr 16, 2025 · 6:03 PM UTC

Zhiqing Sun

@EdwardSun0909

16 Apr 2025

One model, all tools

OpenAI

@OpenAI

16 Apr 2025

Introducing OpenAI o3 and o4-mini—our smartest and most capable models to date. For the first time, our reasoning models can agentically use and combine every tool within ChatGPT, including web search, Python, image analysis, file interpretation, and image generation.

106

6,619

Zhiqing Sun · May 5, 2023 · 6:56 PM UTC

Zhiqing Sun

@EdwardSun0909

5 May 2023

We developed Dromedary, a self-aligned AI agent with minimal human supervision!

@_akhaliq

5 May 2023

Principle-Driven Self-Alignment of Language Models from Scratch with Minimal Human Supervision abs: arxiv.org/abs/2305.03047 paper page: huggingface.co/papers/2305.0… project page: mitibmdemos.draco.res.ibm.co…

22,449

Zhiqing Sun · Jan 20, 2024 · 9:58 PM UTC

Zhiqing Sun

@EdwardSun0909

20 Jan 2024

Our paper on ✨ Self-Aligning Language Models via RLAIF ✨ has been officially accepted at @iclr_conf 2024! We're thrilled to share our insights in Vienna. Stay tuned for self-aligning advancements in LLMs. #ICLR2024 See you there! 🌍🚀

Zhiqing Sun

@EdwardSun0909

11 Oct 2023

13,382

Zhiqing Sun · May 9, 2025 · 1:41 AM UTC

Zhiqing Sun

@EdwardSun0909

9 May 2025

Deep research in your own repos!

OpenAI Developers

@OpenAIDevs

8 May 2025

You can now connect GitHub repos to deep research in ChatGPT. 🐙 Ask a question and the deep research agent will read and search the repo’s source code and PRs, returning a detailed report with citations. Hit deep research → GitHub to get started.

6,793

Zhiqing Sun · Mar 27, 2025 · 4:46 PM UTC

Zhiqing Sun

@EdwardSun0909

27 Mar 2025

Excited to present with @isafulf tonight at the OpenAI Forum, introducing the research behind Deep Research! Join us at 6pm PT to explore how this new agentic capability in ChatGPT works. Register here:

Jason Kwon

@jasonkwon

25 Mar 2025

Replying to @jasonkwon

12,718

Zhiqing Sun · Apr 9, 2025 · 3:42 AM UTC

Zhiqing Sun

@EdwardSun0909

9 Apr 2025

high-taste testers yield high-taste takes

7,891

Zhiqing Sun · Apr 21, 2020 · 6:13 PM UTC

Zhiqing Sun

@EdwardSun0909

21 Apr 2020

"A Re-evaluation of Knowledge Graph Completion Methods" accepted to ACL 2020 #acl2020nlp . We performed an extensive re-examination study of recent neural network based KGC techniques. arxiv.org/abs/1911.03903 Joint work with @svjan5 , @ssanyal8 , @partha_p_t , and Yiming Yang

Zhiqing Sun · Apr 15, 2025 · 7:47 PM UTC

Zhiqing Sun

@EdwardSun0909

15 Apr 2025

Pretty sure we just dropped a benchmark for deep research agents 😬 openai.com/index/browsecomp/ Need a hand over here?

BrowseComp: a benchmark for browsing agents

BrowseComp: a benchmark for browsing agents.

openai.com

Anthropic

@AnthropicAI

15 Apr 2025

Today we’re launching Research, alongside a new Google Workspace integration. Claude now brings together information from your work and the web.

7,908

Zhiqing Sun · Sep 28, 2025 · 6:41 PM UTC

Zhiqing Sun

@EdwardSun0909

28 Sep 2025

Replying to @dhruv31415

I didn’t realize Aidan just unfollowed me for this 😅 I asked chatgpt to polish my wording

9,549

Zhiqing Sun · Feb 3, 2025 · 1:13 AM UTC

Zhiqing Sun

@EdwardSun0909

3 Feb 2025

🩵🩵🩵

Sam Altman

@sama

3 Feb 2025

congrats to the team, especially @isafulf and @EdwardSun0909, for building an incredible product. my very approximate vibe is that it can do a single-digit percentage of all economically valuable tasks in the world, which is a wild milestone.

9,057

Zhiqing Sun · Dec 10, 2024 · 2:31 AM UTC

Zhiqing Sun

@EdwardSun0909

10 Dec 2024

Excited to share the final work from my PhD: Easy-to-Hard Generalization at NeurIPS! Join me at my poster on Friday—happy to chat about reasoning, scalable alignment, and more. Bonus: we’ll also have an oral presentation at the MATH-AI workshop on the inference scaling law!

Sean Welleck

@wellecks

7 Nov 2024

Easy-to-Hard Generalization was accepted to NeurIPS! Congrats to @EdwardSun0909 and @scut_longhui! Check out the updated camera-ready version here: openreview.net/pdf?id=qwgfh2…

26,751

Zhiqing Sun · May 9, 2023 · 4:43 PM UTC

Zhiqing Sun

@EdwardSun0909

9 May 2023

Many people asked me about SELF-ALIGN vs. Constitutional AI (CAI). In short: CAI is self-critique: input ➡️ output ➡️ one rule ➡️ refined output SELF-ALIGN: input ➡️ self-chosen rules ➡️ output Thus, we're limited to 16 rules in our prompt, whereas CAI can have up to 58+ rules.

Anthropic

@AnthropicAI

9 May 2023

How does a language model decide which questions it will engage with and which it deems inappropriate? We use Constitutional AI to more directly encode values into our language models.

ALT Image of a scroll representing a constitution with a neural network design on it

11,158

Zhiqing Sun · Mar 31, 2025 · 8:06 PM UTC

Zhiqing Sun

@EdwardSun0909

31 Mar 2025

Open AI🫡

Sam Altman

@sama

31 Mar 2025

TL;DR: we are excited to release a powerful new open-weight language model with reasoning in the coming months, and we want to talk to devs about how to make it maximally useful: openai.com/open-model-feedba… we are excited to make this a very, very good model! __ we are planning to release our first open-weigh language model since GPT-2. we’ve been thinking about this for a long time but other priorities took precedence. now it feels important to do. before release, we will evaluate this model according out our preparedness framework, like we would for any other model. and we will do extra work given that we know this model will be modified post-release. we still have some decisions to make, so we are hosting developer events to gather feedback and later play with early prototypes. we’ll start in SF in a couple of weeks followed by sessions in europe and APAC. if you are interested in joining, please sign up at the link above. we’re excited to see what developers build and how large companies and governments use it where they prefer to run a model themselves.

4,874

Zhiqing Sun · Jul 18, 2025 · 5:03 PM UTC

Zhiqing Sun

@EdwardSun0909

18 Jul 2025

Another tip: it generates a real pptx file. So you can download the artifact, open it in microsoft powerpoint app, and apply the design you want to all of them!

Isa Fulford

@isafulf

18 Jul 2025

tip for chatgpt agent slides: first ask it to do the research only, then ask it to make the slides!

10,528

Zhiqing Sun · Jul 19, 2024 · 4:51 PM UTC

Zhiqing Sun

@EdwardSun0909

19 Jul 2024

Check our new work on improving neural theorem proving by giving LLMs more time to think before each tactic action! I think this is an important step towards fully exploiting LLMs’ reasoning power & agentic abolity in formal mathematics.💪

Sean Welleck

@wellecks

19 Jul 2024

How can informal reasoning improve formal theorem proving? New paper: "Lean-STaR: Learning to Interleave Thinking and Proving" arxiv.org/abs/2407.10040 We introduce a framework for learning to interleave informal thoughts with steps of formal proving. 46.3% on miniF2F 🔥

6,936

Zhiqing Sun · Aug 5, 2025 · 5:44 PM UTC

Zhiqing Sun

@EdwardSun0909

5 Aug 2025

🐐🐐🐐@xinw_ai @michiyasunaga @ren_hongyu

Angel 🌼

@Angaisb_

5 Aug 2025

Replying to @sama

Sam this performance is crazy

9,168

Zhiqing Sun · Feb 27, 2025 · 8:08 PM UTC

Zhiqing Sun

@EdwardSun0909

27 Feb 2025

Feel the largest model vibe 🚀

OpenAI

@OpenAI

27 Feb 2025

GPT-4.5 has entered the Chat. openai.com/live/

5,740

Zhiqing Sun · Jul 8, 2023 · 8:24 PM UTC

Zhiqing Sun

@EdwardSun0909

8 Jul 2023

En route to #ACL2023!🤟 No submissions from me this time, but I'm all set for exciting poster chats and casual networking! (all thanks to LTI financial support) Feel free to DM me if you’d like to chat about Self-Alignment (Scalable Oversight) about LLMs

4,517

Zhiqing Sun · Jul 22, 2023 · 9:58 AM UTC

Zhiqing Sun

@EdwardSun0909

22 Jul 2023

En route to #ICML2023 at Hawaii 🏝️! This time I’ll present a main conference paper on neural PDE solving and a workshop paper on neural combinatorial optimization solving. I’m also happy to share my thoughts on (my recent research on) LLM Alignment. Feel free to DM me!

5,870

Zhiqing Sun · Jul 10, 2025 · 5:17 AM UTC

Zhiqing Sun

@EdwardSun0909

10 Jul 2025

no we go back to 1 in 2024

Eric Zelikman

@ericzelikman

10 Jul 2025

ALT ai model version numbers over time

7,780

Zhiqing Sun · Mar 12, 2025 · 9:13 AM UTC

Zhiqing Sun

@EdwardSun0909

12 Mar 2025

I blame @MistralAI for being the first to make this kind of confusing diagam 😅

Alexandre Ramé @ramealexandre

12 Mar 2025

Welcome Gemma 3, our new open-weight LLM from @GoogleDeepMind. All sizes (1B, 4B, 12B and 27B) excel on benchmarks, but the key result may be the 27B reaching 1338 on LMSYS. For this, we scaled post-training, with our novel distillation, RL and merging strategies. Happy building!

7,108

Zhiqing Sun · Dec 14, 2024 · 5:49 AM UTC

Zhiqing Sun

@EdwardSun0909

14 Dec 2024

what

Jiao Sun

@sunjiao123sun_

14 Dec 2024

Mitigating racial bias from LLMs is a lot easier than removing it from humans! Can’t believe this happened at the best AI conference @NeurIPSConf We have ethical reviews for authors, but missed it for invited speakers? 😡

3,779

Zhiqing Sun · Mar 2, 2025 · 6:23 PM UTC

Zhiqing Sun

@EdwardSun0909

2 Mar 2025

🫡🫡🫡

Sam Altman

@sama

2 Mar 2025

Replying to @BLCNYY @OpenAI

we are working to make it much more efficient and then will offer much higher usage limits in the meantime, please send me your email address

6,743

Zhiqing Sun · Jul 29, 2024 · 10:00 PM UTC

Zhiqing Sun

@EdwardSun0909

29 Jul 2024

Check out our winning formula in AIMO with only $1000 budget😎 Amazing work by @WYZ0402 in only 1 post-NeurIPS month💪

ML@CMU @mlcmublog

29 Jul 2024

🔥Our CMU-MATH team proudly clinched 2nd place in the Artificial Intelligence Mathematical Olympiad (AIMO) out of 1,161 participating teams with the best performance of an academic team! Dive into our blog to discover our winning formula: blog.ml.cmu.edu/2024/07/29/c…

5,636

Zhiqing Sun · Mar 26, 2025 · 2:40 AM UTC

Zhiqing Sun

@EdwardSun0909

26 Mar 2025

Congrats!!

Gabriel Goh

@gabeeegoooh

25 Mar 2025

this is ready now for the world

3,330

Zhiqing Sun · Jul 25, 2025 · 8:58 PM UTC

Zhiqing Sun

@EdwardSun0909

25 Jul 2025

Replying to @shengjia_zhao

Congrats!

8,505

Zhiqing Sun · Jul 17, 2025 · 6:15 PM UTC

Zhiqing Sun

@EdwardSun0909

17 Jul 2025

Great work!!

Xikun Zhang 张熙堃

@xikun_zhang_

17 Jul 2025

Just launched ChatGPT Agent (sorry GPT-5 waiters, it is coming!), the most capable AI agent model to date! It has been such an honor to be part of a crazy sprint to get this amazing model trained and shipped together with an absolutely gem team (@isafulf , @caseychu9 , @EdwardSun0909 , @josh_tobin_ Yash Kumar and many more)! I am so proud of this project, so I want to share some highlights, personal takes and lessons learned while working on it: 1. Used for research 📕 + actions 💻 + slides generation: Deep Research can do research. Operator can take actions for you. ChatGPT Agent can do both at the same time! E.g. you can ask it to make a plan for a trip to Hawaii, find good deals on hotels and flights, and book them on your behalf using its own computer! It can also generate slides! 2. Power of end-to-end RL: How do we build it? You guess it right! It is us, @OpenAI RL diehards. You are probably tired of hearing about RL scaling. Me, too. But when I feel its power first-hand, its effectiveness and data efficiency still shock me and feel like magic 🪄. 3. First OpenAI model of high biorisk 💀: Not sure this is something I should proud of or not :) For an ex-AI bio PhD researcher like me, this is something a bit personal. One one hand, many of my biomedicine researcher friends tell me that AI agents have significantly helped with their research. On the other hand, such a capable model can amplify the risk of malicious actors building bioweapons. Our safety team has done incredible work to mitigate the risks. 4. Collaboration with users 👪 is core: We want our AI to augment and enhance humans, not to replace them, so we work hard to make the model good at collaborating with the user. You can type a message at anytime to interrupt it and steer it to new directions. The model will always confirm with you before taking actions like buying things for you or deleting a file on your google drive. And the model will ask clarification questions only when it needs more clarity from you! 5. How to generate good slides: As in other cases, writing a well-specified prompt always helps! Also try first telling it to generate a report, then convert the report into slides! 6. Real-world performance > benchmark chasing: One thing outside people may not know about us is how little attention we pay to external benchmarks during the model dev process. We do not focus on hill-climbing on them, and we do not care that much about how we end up on the leaderboard. That said, as a byproduct of our pursuit to great real-world performance and true intelligence, ChatGPT Agent does crush many benchmarks! Wanna learn more? Read our blog linked in the end! In the end, I want to shout out to my amazing team again. These extremely talented and kind people are the reason why OpenAI is constantly making magic like this! ❤️ Also please try ChatGPT Agent and give us feedback! You can reply here in the thread or my DM is open. This is just the start. We will continue working hard towards more and more capable super-human AI agents! 🤖 openai.com/index/introducing…

3,956

Zhiqing Sun · Apr 16, 2020 · 5:03 PM UTC

Zhiqing Sun

@EdwardSun0909

16 Apr 2020

I'm excited to announce "MobileBERT: a Compact Task-Agnostic BERT for Resource-Limited Devices" accepted to ACL 2020. Joint work with researchers from @LTIatCMU @GoogleAI Paper: arxiv.org/abs/2004.02984 Code & Pretrained Model: github.com/google-research/g… [1/4]

Zhiqing Sun · Dec 14, 2024 · 7:37 PM UTC

Zhiqing Sun

@EdwardSun0909

14 Dec 2024

The new era of post-training has arrived. Join us!

Jason Wei

@_jasonwei

13 Dec 2024

Yall heard it from the man himself

4,885

Zhiqing Sun · Dec 8, 2023 · 7:16 PM UTC

Zhiqing Sun

@EdwardSun0909

8 Dec 2023

Next week, I'm thrilled to be at #NeurIPS in New Orleans! Along with my co-authors, we're showcasing our recent works: DIFUSCO (arxiv.org/abs/2302.08224) and Self-Align (arxiv.org/abs/2305.03047). Can't wait to reconnect with familiar faces and meet new ones. See you there! 🌟✨🤝

DIFUSCO: Graph-based Diffusion Solvers for Combinatorial Optimization

Neural network-based Combinatorial Optimization (CO) methods have shown promising results in solving various NP-complete (NPC) problems without relying on hand-crafted domain knowledge. This paper...

arxiv.org

4,420

Zhiqing Sun · Sep 12, 2024 · 5:16 PM UTC

Zhiqing Sun

@EdwardSun0909

12 Sep 2024

🍓🍓🍓

OpenAI

@OpenAI

12 Sep 2024

We're releasing a preview of OpenAI o1—a new series of AI models designed to spend more time thinking before they respond. These models can reason through complex tasks and solve harder problems than previous models in science, coding, and math. openai.com/index/introducing…

3,237

Zhiqing Sun · May 8, 2024 · 7:26 PM UTC

Zhiqing Sun

@EdwardSun0909

8 May 2024

How to steer the model’s behavior in a scalable manner so that we can control how we want these models to behave? Perhaps try Principle-Driven (Self-)Alignment! We have a series of work such as Self-Align & SALMON covering context distillation & RLAIF🤩🤩🤩

OpenAI

@OpenAI

8 May 2024

To deepen the public conversation about how AI models should behave, we’re sharing our Model Spec — our approach to shaping desired model behavior. openai.com/index/introducing…

5,909

Zhiqing Sun · May 8, 2024 · 9:54 PM UTC

Zhiqing Sun

@EdwardSun0909

8 May 2024

How to technically realize @OpenAI Model Spec based on any set of human-defined principles? Discover why an Instructable Reward Model is all you need at our SALMON poster session #ICLR tmr, presented by the brilliant @QinhongZhou. 📅 Thurs, May 9, 10:45 AM CEST 📍 Halle B #7

Zhiqing Sun

@EdwardSun0909

11 Oct 2023

4,183

Zhiqing Sun · May 13, 2023 · 9:17 PM UTC

Zhiqing Sun

@EdwardSun0909

13 May 2023

Check our new work!

John Nay

@johnjnay

12 May 2023

Active LLM Retrieval Augmented Generation -Iteratively uses a prediction of upcoming sentence to anticipate future content which is used as query to retrieve relevant docs to regenerate sentence -On 4 long-form generation tasks: superior / competitive arxiv.org/abs/2305.06983

5,558

Zhiqing Sun · Aug 7, 2024 · 9:05 PM UTC

Zhiqing Sun

@EdwardSun0909

7 Aug 2024

Very beautiful scaling plot😍

Sean Welleck

@wellecks

7 Aug 2024

Replying to @wellecks

We study common inference strategies (e.g., majority voting, MCTS) and a new tree search, along with various model sizes. First, we hold the inference strategy fixed, and find that across different model sizes, smaller models typically have better accuracy-cost tradeoffs

5,565

Zhiqing Sun · Jun 26, 2025 · 7:02 PM UTC

Zhiqing Sun

@EdwardSun0909

26 Jun 2025

🥹

OpenAI Developers

@OpenAIDevs

26 Jun 2025

Replying to @OpenAIDevs

o3-deep-research: platform.openai.com/docs/mod… o4-mini-deep-research: platform.openai.com/docs/mod… These models are the same post-trained o3 and o4-mini models that power deep research in ChatGPT. They also support MCP (search/fetch) and Code Interpreter.

3,583

Zhiqing Sun · May 17, 2025 · 2:06 AM UTC

Zhiqing Sun

@EdwardSun0909

17 May 2025

🍳

Greg Brockman

@gdb

17 May 2025

2025 is the year of agents.

4,190

Zhiqing Sun · Feb 22, 2023 · 8:37 PM UTC

Zhiqing Sun

@EdwardSun0909

22 Feb 2023

We propose a new paradigm called RECITation-augmented gEneration (RECITE) that helps Large Language Models (LLMs) generate accurate factual knowledge by reciting relevant passages from their own memory before producing final answers. 2/N

1,338

Zhiqing Sun · Dec 7, 2023 · 7:32 AM UTC

Zhiqing Sun

@EdwardSun0909

7 Dec 2023

Combinatorial optimization (CO) problems are essential in many fields like operation research / software engineering / algorithm theory. We introduce a new paradigm to tackle CO problems with diffusion models. Accepted at NeurIPS as Spotlight! arxiv.org/abs/2302.08224 w/ Yiming

DIFUSCO: Graph-based Diffusion Solvers for Combinatorial Optimization

Neural network-based Combinatorial Optimization (CO) methods have shown promising results in solving various NP-complete (NPC) problems without relying on hand-crafted domain knowledge. This paper...

arxiv.org

1,583

Zhiqing Sun · Aug 15, 2025 · 7:51 AM UTC

Zhiqing Sun

@EdwardSun0909

15 Aug 2025

Replying to @clu_cheng

what kind of tears 😏

3,732

Zhiqing Sun · Oct 13, 2023 · 9:21 PM UTC

Zhiqing Sun

@EdwardSun0909

13 Oct 2023

If I were given one hour to build an #AGI system, I would spend 59 minutes defining the principles it should follow, and one minute clicking the training button 🔥🔥🔥🔥 github.com/IBM/SALMON/blob/m…

Denny Zhou

@denny_zhou

13 Oct 2023

“If I were given one hour to save the planet, I would spend 59 minutes defining the problem and one minute resolving it.” — Albert Einstein

15,413

Zhiqing Sun · May 3, 2024 · 9:44 PM UTC

Zhiqing Sun

@EdwardSun0909

3 May 2024

The code and model weights will be released soon. We found there is a community implementation of the hard prob version of SPPO in TRL. We have submitted a PR to fix some bugs: github.com/huggingface/trl/p… Please note that the iterative SPPO results in our paper use soft probs.

corrects loss function for Self-play Preference Optimization hard label version by angelahzyuan ·...

Corrects implementation mentioned in #1612. arxiv: https://arxiv.org/abs/2405.00675. This updates the loss function according to Equation (4.8) with $P(y_w > y_l) = 1$ and $P(y_l &am...

github.com

5,850

Zhiqing Sun · Jul 25, 2024 · 3:31 AM UTC

Zhiqing Sun

@EdwardSun0909

25 Jul 2024

Replying to @lilianweng

👀 might be related:

Zhiqing Sun

@EdwardSun0909

11 Oct 2023

2,358

Zhiqing Sun · May 9, 2023 · 3:44 AM UTC

Zhiqing Sun

@EdwardSun0909

9 May 2023

Totally feel you🤪. You might find our new preprint interesting – it's about aligning LLMs from scratch! arxiv.org/abs/2305.03047

Principle-Driven Self-Alignment of Language Models from Scratch...

Recent AI-assistant agents, such as ChatGPT, predominantly rely on supervised fine-tuning (SFT) with human annotations and reinforcement learning from human feedback (RLHF) to align the output of...

arxiv.org

Jing Yu Koh

@kohjingyu

9 May 2023

Genuine question: What's the (scientific) value of these recent papers that finetune a smaller LM on GPT-4 outputs? It's obviously useful to have a smaller LM that performs ≈ GPT-4 in specific settings. But I don't see the value in packaging into a paper and flooding arXiv.

3,601

Zhiqing Sun · Feb 3, 2025 · 12:59 AM UTC

Zhiqing Sun

@EdwardSun0909

3 Feb 2025

Replying to @ren_hongyu

♣️🫵🐮!

6,404

Zhiqing Sun · Oct 30, 2025 · 2:04 AM UTC

Zhiqing Sun

@EdwardSun0909

30 Oct 2025

Congrats!

Yash Patil

@ypatil125

29 Oct 2025

Today, @rhythmrg, @lindensli and I are introducing @appliedcompute. We’re building Specific Intelligence for the enterprise. Achieving SOTA today means specialization in both human and machine talent. We’ve spent the last six months working with companies like @cognition, @DoorDash, and @mercor_ai, unlocking their company knowledge to build custom agent workforces that outperform frontier models at specific tasks. My cofounders and I all worked on different parts of this problem while at OpenAI, from Codex to o1 to the ML systems and infrastructure for RL training. Two-thirds of our team (see below!) are former founders, and everyone brings a deep technical background, from top AI researchers to Math Olympiad winners. We’ve raised $80M from @benchmark, @sequoia, @Lux_Capital, @eladgil, @victoralazarte, and @Casspi18, and we’re hiring across engineering and research.

9,867

Zhiqing Sun · Jul 23, 2024 · 6:20 PM UTC

Zhiqing Sun

@EdwardSun0909

23 Jul 2024

Replying to @zdhnarsil

try this: arxiv.org/abs/2302.08224

DIFUSCO: Graph-based Diffusion Solvers for Combinatorial Optimization

Neural network-based Combinatorial Optimization (CO) methods have shown promising results in solving various NP-complete (NPC) problems without relying on hand-crafted domain knowledge. This paper...

arxiv.org

2,387

Zhiqing Sun · Dec 11, 2019 · 12:01 AM UTC

Zhiqing Sun

@EdwardSun0909

11 Dec 2019

Curious about how to improve non-autoregressive models with conditional random fields? How to deal with extremely large vocabulary size in CRF for machine translation? Come to our poster at Wednesday evening at East hall #109 at #NeurIPS2019 @NeurIPSConf @zhuohan123 @suzzzylin

Zhiqing Sun · Oct 23, 2023 · 11:15 PM UTC

Zhiqing Sun

@EdwardSun0909

23 Oct 2023

Very interesting observation by @AnthropicAI on AIs often producing 'sycophantic' responses to appease users. Curious if the RLAIF could address this. Maybe a new principle under SALMON's RL-Time Preference Intervention could be "Maintain integrity, avoid sycophancy"? 🤔

Anthropic

@AnthropicAI

23 Oct 2023

AI assistants are trained to give responses that humans like. Our new paper shows that these systems frequently produce ‘sycophantic’ responses that appeal to users but are inaccurate. Our analysis suggests human feedback contributes to this behavior.

Left: Text “Towards Understanding Sycophancy in Language Models,” Mrinank Sharma*, Meg Tong* et al. Right: A black and white image of mountains and a lake.

ALT Left: Text “Towards Understanding Sycophancy in Language Models,” Mrinank Sharma*, Meg Tong* et al. Right: A black and white image of mountains and a lake.

2,172

Zhiqing Sun · Oct 27, 2023 · 10:00 PM UTC

Zhiqing Sun

@EdwardSun0909

27 Oct 2023

Scaling laws for DPO is still unproven ==> I wonder if the problem is the scaling laws. To me, the primary concern of DPO is that it's only proven effective when we have high-quality (or distilled) demonstrations as positive examples. Similar observation:

Nathan Lambert

@natolambert

27 Oct 2023

The Zephyr-beta model from @huggingface H4 (led by @_lewtun and @edwardbeeching these days) is a great example of engineering practices and know how slowly kicking into gear for RLHF. Some takeaways beyond "high MT Bench and AlpacaEval scores": * DPO can work great for smaller models. This is huge for open-ML as small specialized models are the future. People need to try DPO on specialized application feedback datasets! * Long-term engineering investment can pay off on unexpected timelines. I left a week or two before Zephyr, and we didn't even have it on the plan yet. Finding the right dataset and plugging it into the pipeline can be everything. * Scaling laws for DPO is still unproven. Lots and lots of RLHF experts are skeptical of it for larger models. I think there may need a slight change of the loss function for stability (and maybe sample efficiency is lower), which is why the H4 team found success in multiple Epochs. * MT Bench / Alpaca Eval are pretty saturated. Next year, your model is going to need to get 7+ on MTBench to be considered, but above that may not matter. It's getting normalized, but we need more eval tools still. *AI Feedback is extremely broad. For this model, it was data curation. I expect it to also work for filtering and more. * Releasing both SFT and RL checkpoints with data is great for replication (as the team did). Excited to see where this goes next! Paper: arxiv.org/abs/2310.16944 Artifacts: huggingface.co/collections/H…

2,117

Zhiqing Sun · Mar 21, 2024 · 2:41 PM UTC

Zhiqing Sun

@EdwardSun0909

21 Mar 2024

Building on the idea that “evaluation is easier than generation”, we find that strong reward models trained on easy data facilitate easy-to-hard generalization via reranking or reinforcement learning. (3/n)

1,612

Zhiqing Sun · Mar 17, 2025 · 9:40 PM UTC

Zhiqing Sun

@EdwardSun0909

17 Mar 2025

🥲🥲🥲

Liam Fedus

@LiamFedus

17 Mar 2025

This is what I sent to my colleagues at OpenAI: Hi all, I made the difficult decision to leave OpenAI as an employee, but I’m looking to work closely together as a partner going forward. Contributing to the mission of OpenAI and working with world-class teams to create and improve ChatGPT has been an experience of a lifetime. But I’ve gotten really excited about AI for science. My undergrad was in physics and I’m keen to apply this technology there. Because AI for science is one of the most strategically important areas to OpenAI and achieving ASI, OpenAI is planning to invest in and partner with my new company. So I’ll see you all around! Thanks to all the leadership who believed in me early on, especially, Sam, Greg, and Mark. Thank you everyone on post-training and to all of our collaborators across research and product. I’ll miss working with so many of you, but will be cheering you on! Post-training has an amazing roster of talent and leaders who will continue to drive its success.

3,525

Zhiqing Sun · Nov 7, 2025 · 3:02 AM UTC

Zhiqing Sun

@EdwardSun0909

7 Nov 2025

Replying to @rogerw0108 @zhuohan123

no we’ll achieve the real agi and serve it with vllm 😎

1,835

Zhiqing Sun · May 23, 2023 · 8:31 PM UTC

Zhiqing Sun

@EdwardSun0909

23 May 2023

Check out our active retrieval augmented generation method that actively decides when and what to retrieve!

Luyu Gao @luyu_gao

23 May 2023

[1/4] Large language models (LLMs) tend to hallucinate, especially when generating long outputs. We present active retrieval augmented generation, in which an LLM actively decides when and what to retrieve throughout the generation process.

2,824

Zhiqing Sun · Feb 2, 2025 · 10:11 PM UTC

Zhiqing Sun

@EdwardSun0909

2 Feb 2025

Replying to @ibab

The stars will align at 4 pm 🫡

791

Zhiqing Sun · Dec 30, 2023 · 8:39 AM UTC

Zhiqing Sun

@EdwardSun0909

30 Dec 2023

Nice work! 👏 But when it comes to extreme data-efficiency, I guess our Dromedary takes the lead in 2023! We've achieved a MT-Bench score of 7.37 and a 88.32 AlpacaEval with just **6** (no K) SFT samples. The secret sauce lies in our Self-Align and RLAIF. arxiv.org/abs/2310.05910

Junxian He @junxian_he

28 Dec 2023

💡 We release methods and datasets for extremely data-efficient alignment 🚀 6K SFT samples lead to 7.22 MT-Bench score, further DPO with 10K samples achieve 7.55 MT-Bench +90+% AlpacaEval Try our data to align models more efficiently: github.com/hkust-nlp/deita

1,909

Zhiqing Sun · May 9, 2023 · 3:24 PM UTC

Zhiqing Sun

@EdwardSun0909

9 May 2023

Replying to @generatorman_ai

We tried the same self-align prompt (step2 in our paper) on the LLaMA-7b and GPT-NeoX-20B models, but their performance did not match that of the 65b model. So I believe that the principle-driven self-align method only works for models that are powerful enough, though (cont.)

4,594

Zhiqing Sun · Jun 15, 2023 · 7:49 PM UTC

Zhiqing Sun

@EdwardSun0909

15 Jun 2023

Hi Yizhong, you classified the instr.-following datasets into: 1) existing NLP datasets 2) written by humans from scratch 3) generated by proprietary models 4) user-shared. Have you considered "generated by OSS models", like applying Self-Instruct or Self-Align to base LLaMA?

Yizhong Wang @yizhongwyz

9 Jun 2023

🦙🐪🐫 So many instruction tuning datasets came out recently! How valuable are they, and how far are open models really from proprietary ones like ChatGPT? 🧐We did a systematic exploration, and built Tülu---a suite of LLaMa-tuned models up to 65B! 📜arxiv.org/abs/2306.04751

3,390

Zhiqing Sun · Mar 21, 2024 · 2:41 PM UTC

Zhiqing Sun

@EdwardSun0909

21 Mar 2024

We study this in terms of easy-to-hard generalization. This is a conceptual analogy of superalignment (weak-to-strong generalization). Instead of letting strong models learn from weak teachers, we let models generalize to problems more difficult than those seen during training.

2,359

Zhiqing Sun · Apr 3, 2024 · 5:50 PM UTC

Zhiqing Sun

@EdwardSun0909

3 Apr 2024

Check out the new work on aligning Video LMMs with factually-enhanced DPO!🐕 🐕 🐕

Ruohong Zhang

@RuohongZhang

3 Apr 2024

[p1] 🐕Direct Preference Optimization of Video Large Multimodal Models from Language Model Reward🐕 Paper link: arxiv.org/pdf/2404.01258.pdf… page: github.com/RifleZhang/LLaVA-… How to effectively train video large multimodal Model (LMM) alignment with preference modeling?

2,369

Zhiqing Sun · Oct 11, 2023 · 5:26 PM UTC

Zhiqing Sun

@EdwardSun0909

11 Oct 2023

Inspired by Asimov's three laws of robotics, we envision a future where a few general principles can be internalized by AI systems. This aligns with recent advances in self-alignment, aiming for models to improve themselves with minimal human supervision. 2/N

1,912

Zhiqing Sun · Oct 13, 2023 · 8:51 PM UTC

Zhiqing Sun

@EdwardSun0909

13 Oct 2023

Replying to @natolambert

Excited to share our recent work: SALMON (Self-ALignMent with principle-fOllowiNg reward models) - minimizing dependency on human annotations for aligning LLM-based AI agents, through principle-following reward models. Would love to be featured! arxiv.org/abs/2310.05910

SALMON: Self-Alignment with Instructable Reward Models

Supervised Fine-Tuning (SFT) on response demonstrations combined with Reinforcement Learning from Human Feedback (RLHF) constitutes a powerful paradigm for aligning LLM-based AI agents. However, a...

arxiv.org

4,479

Zhiqing Sun · Dec 1, 2022 · 9:43 PM UTC

Zhiqing Sun

@EdwardSun0909

1 Dec 2022

Come check our paper at #NeurIPS22 Now! Poster Session 4-6 pm Booth #128 DIMES: A Differentiable Meta Solver for Combinatorial Optimization Problems bytez.com/read/neurips/54442 #NeurIPS2022 #bytez #friendly-papers via @bytez

Zhiqing Sun · Mar 21, 2024 · 2:41 PM UTC

Zhiqing Sun

@EdwardSun0909

21 Mar 2024

Heartfelt thanks to our exceptional team for their collaborative spirit and invaluable insights. @EdwardSun0909 @scut_longhui @Yikang_Shen @Besteuler Yiming Yang @wellecks @gan_chuang Our code and model are open-sourced: github.com/Edward-Sun/easy-t…

GitHub - Edward-Sun/easy-to-hard: Easy-to-Hard Generalization: Scalable Alignment Beyond Human...

Easy-to-Hard Generalization: Scalable Alignment Beyond Human Supervision - Edward-Sun/easy-to-hard

github.com

1,117

Zhiqing Sun · Feb 22, 2023 · 8:37 PM UTC

Zhiqing Sun

@EdwardSun0909

22 Feb 2023

Joint work w/ Xuezhi Wang, @YiTayML, Yiming Yang, and @denny_zhou (@LTIatCMU and @GoogleAI). Our code is available at github.com/Edward-Sun/RECITE. 7/N

GitHub - Edward-Sun/RECITE: Code of ICLR paper: https://openreview.net/forum?id=-cqvvvb-NkI

Code of ICLR paper: https://openreview.net/forum?id=-cqvvvb-NkI - Edward-Sun/RECITE

github.com

777

Zhiqing Sun · Mar 21, 2024 · 2:41 PM UTC

Zhiqing Sun

@EdwardSun0909

21 Mar 2024

BTW, we are not the first to study the easy-to-hard scenario. Concurrent study by Hase et al. (2024) backs training on easy tasks as a strong baseline for ARC & MMLU. Our work on the harder MATH dataset shows reranking & RL have even better generalization than ICL & SFT. (6/n)

1,232

Zhiqing Sun · Dec 7, 2023 · 11:08 PM UTC

Zhiqing Sun

@EdwardSun0909

7 Dec 2023

Replying to @billyuchenlin @_albertgu @tri_dao

Interesting work! Have you tried our magic Self-Align prompt?🧐 We also used some kind of ICL but uses an additional explicit principle-following step: Re-align: raw.githubusercontent.com/Re… Self-align: raw.githubusercontent.com/IB…

1,225