Tsinghua KEG (THUDM) · Aug 14, 2024 · 3:56 AM UTC

Tsinghua KEG (THUDM)

Pinned Tweet

Tsinghua KEG (THUDM)

@thukeg

14 Aug 2024

#VisualAgentBench: 4o, 4o-mini, 3.5-sonnet currently have an edge as visual foundation agents for now, but open models InternVL & GLM-4V are catching up fast, a similar story to LLMs as agents as revealed in #AgentBench back in Aug 2023. arxiv.org/pdf/2408.06327 github.com/THUDM/VisualAgent…

8,863

Tsinghua KEG (THUDM) · Aug 4, 2022 · 6:15 PM UTC

Tsinghua KEG (THUDM)

@thukeg

4 Aug 2022

Introducing GLM-130B, an open bilingual (English&Chinese) language model with 130 billion parameters, designed for users with a single A100 (40G*8) or V100 (32G*8) server. #GLM130 #LLM #GPT3 Learn more & download: keg.cs.tsinghua.edu.cn/glm-1… github.com/THUDM/GLM-130B

374

Tsinghua KEG (THUDM) · Mar 14, 2023 · 3:49 PM UTC

Tsinghua KEG (THUDM)

@thukeg

14 Mar 2023

ChatGLM-6B & ChatGLM! ChatGLM-6B is an open CN&EN model w/ 6.2B paras (optimized for Chinese QA & dialogue for now). Trained for 1T tokens, SFT, Feedback Bootstrap, & RLHF. w INT4 quantization, we can deploy on one 2080Ti card (6GB GPU mem required). github.com/THUDM/ChatGLM-6B/…

276

110,199

Tsinghua KEG (THUDM) · Sep 20, 2022 · 1:37 PM UTC

Tsinghua KEG (THUDM)

@thukeg

20 Sep 2022

Introducing #CodeGeeX, an open 13B multilingual code generative model capable of 1) generation of 15 programming languages & 2) translating btw on one click! Learn more: keg.cs.tsinghua.edu.cn/codeg… Search "codegeex" in #VSCode FOR FREE! DEMO: models.aminer.cn/codegeex/pl… Results next:

250

Tsinghua KEG (THUDM) · Oct 16, 2023 · 5:51 PM UTC

Tsinghua KEG (THUDM)

@thukeg

16 Oct 2023

#CogVLM: open vision language models - deep fusion btn LLM & image encoder w/ visual experts! CogVLM-17B tops 14 cross-modal benchs, beats #BLIP2, #PaLI-17B/X-55B, #PaLM-E-84B. Paper: github.com/THUDM/CogVLM HF🤗: huggingface.co/THUDM/CogVLM/… @_akhaliq @osanseviero @huggingface

243

81,577

Tsinghua KEG (THUDM) · Oct 10, 2022 · 12:31 PM UTC

Tsinghua KEG (THUDM)

@thukeg

10 Oct 2022

GLM-130B reaches INT4 quantization w/ no perf degradation, allowing effective inference on 4*3090 or 8*2080 Ti GPUs, the most ever affordable GPUs required for using 100B-scale models? Paper: arxiv.org/abs/2210.02414 Model weights & code & demo & lessons: github.com/THUDM/GLM-130B

235

Tsinghua KEG (THUDM) · Mar 23, 2023 · 12:10 PM UTC

Tsinghua KEG (THUDM)

@thukeg

23 Mar 2023

ChatGLM-6B is on the top spot of the 🤗 7-day trending? 220K downloads in a few days, and counting! A big thank you to @huggingface @_akhaliq @osanseviero @yvrjsharma @ClementDelangue and everyone! HF: huggingface.co/THUDM/chatglm… GH: github.com/THUDM/ChatGLM-6B

213

49,682

Tsinghua KEG (THUDM) · Apr 11, 2023 · 12:44 AM UTC

Tsinghua KEG (THUDM)

@thukeg

11 Apr 2023

LoRA & P-Tuning-v2 & other supports added for ChatGLM-6B. Much fewer resources required for tuning your own models! The open-source community is the lead! Check out those efforts here: github.com/THUDM/ChatGLM-6B/…

ChatGLM-6B/README_en.md at main · zai-org/ChatGLM-6B

ChatGLM-6B: An Open Bilingual Dialogue Language Model | 开源双语对话语言模型 - zai-org/ChatGLM-6B

github.com

192

39,264

Tsinghua KEG (THUDM) · Jul 25, 2023 · 8:41 AM UTC

Tsinghua KEG (THUDM)

@thukeg

25 Jul 2023

#CodeGeeX2-6B: 2nd gen. code generation model for 100+ lang. Free #VSCode & #JetBrains plugins (10M lines gen-ed per day). It is based on #ChatGLM2-6B with extra 600B code training. huggingface.co/THUDM/codegee… @huggingface @_akhaliq @osanseviero @ClementDelangue @code @kdd_news

142

46,988

Tsinghua KEG (THUDM) · Oct 2, 2022 · 1:59 AM UTC

Tsinghua KEG (THUDM)

@thukeg

2 Oct 2022

Thanks! The model weights & source code are freely available at keg.cs.tsinghua.edu.cn/codeg… Search "codegeex" in #VSCode FOR FREE!!

@_akhaliq

30 Sep 2022

The @Gradio Demo for CodeGeeX, a large-scale multilingual code generation model with 13 billion parameters, pre-trained on a large code corpus of more than 20 programming languages is now on @huggingface Spaces demo: huggingface.co/spaces/THUDM/…

145

Tsinghua KEG (THUDM) · Nov 8, 2023 · 1:02 PM UTC

Tsinghua KEG (THUDM)

@thukeg

8 Nov 2023

How to #RLHF for LLMs: #PPO or #DPO? Introducing #BPO (black-box prompt optimization) to align LLMs without model training. 1) ChatGPT + BPO > ChatGPT 2) GPT-4 + BPO > GPT-4 3) Vicuna + BPO > Vicuna + PPO/DPO 4) Vicuna + DPO + BPO > Vicuna + DPO arxiv.org/pdf/2311.04155v1.p…

153

30,493

Tsinghua KEG (THUDM) · Oct 16, 2023 · 5:58 PM UTC

Tsinghua KEG (THUDM)

@thukeg

16 Oct 2023

CogVLM: vision language models w/ visual experts that tops 14 cross-modal benchmarks. Paper, model, code @HuggingFace🤗: huggingface.co/THUDM/CogVLM/…

Felix

@felix_red_panda

16 Oct 2023

List of contemporary GPT4V-like open source models - Blip2 - Llava1.5 - CogVLM by @thukeg - StableVL by @vilm_hq (not out yet) - @skunkworks_ai 's model (not out yet) Did I miss any?

138

44,290

Tsinghua KEG (THUDM) · Oct 30, 2023 · 3:10 PM UTC

Tsinghua KEG (THUDM)

@thukeg

30 Oct 2023

OPEN #ChatGLM3-6B: the 3rd gen 1) tops 44 tasks among <10B models 2) supports tool/function call, code interpreter, agent tasks, 32K github.com/THUDM/ChatGLM3 🤗huggingface.co/THUDM/chatglm… #ChatGLM-6Bs: 10M 🤗downloads, thank YOU! @huggingface @ClementDelangue @_akhaliq @osanseviero

119

47,084

Tsinghua KEG (THUDM) · May 18, 2023 · 6:13 AM UTC

Tsinghua KEG (THUDM)

@thukeg

18 May 2023

#ChatGLM-6B v1.1: improved *English* dialogue & translation ability, in addition to Chinese. => #VisualGLM-6B: a multimodal conversational language model that supports 1) images, 2) Chinese, and 3) English w/ #ChatGLM-6B & BLIP2-Qformer. github.com/THUDM/ChatGLM-6B

31,950

Tsinghua KEG (THUDM) · Jul 31, 2023 · 4:40 PM UTC

Tsinghua KEG (THUDM)

@thukeg

31 Jul 2023

#ChatGLM2-6B-32K: better understands long texts based on the ChatGLM2-6B, up to 32K context length based on Positional Interpolation, trained with a 32K context length during dialogue alignment. Download🤗 @huggingface huggingface.co/THUDM/chatglm… github.com/THUDM/ChatGLM2-6B @_akhaliq

16,650

Tsinghua KEG (THUDM) · Nov 8, 2023 · 12:52 PM UTC

Tsinghua KEG (THUDM)

@thukeg

8 Nov 2023

Thanks AK! #CogVLM: an #open visual language model. Chat w CogVLM about images @Gradio github.com/THUDM/CogVLM CogVLM-17B tops 10+ benches: NoCaps, Flicker30k, RefCOCO, RefCOCO+, RefCOCOg, Visual7W, GQA, SciQA, VizWiz VQA & TDIUC, surpassing or matching PaLI-X 55B.

GitHub - zai-org/CogVLM: a state-of-the-art-level open visual language model | 多模态预训练模型

a state-of-the-art-level open visual language model | 多模态预训练模型 - zai-org/CogVLM

github.com

@_akhaliq

7 Nov 2023

CogVLM: Visual Expert for Pretrained Language Models paper page: huggingface.co/papers/2311.0… introduce CogVLM, a powerful open-source visual language foundation model. Different from the popular shallow alignment method which maps image features into the input space of language model, CogVLM bridges the gap between the frozen pretrained language model and image encoder by a trainable visual expert module in the attention and FFN layers. As a result, CogVLM enables deep fusion of vision language features without sacrificing any performance on NLP tasks. CogVLM-17B achieves state-of-the-art performance on 10 classic cross-modal benchmarks, including NoCaps, Flicker30k captioning, RefCOCO, RefCOCO+, RefCOCOg, Visual7W, GQA, ScienceQA, VizWiz VQA and TDIUC, and ranks the 2nd on VQAv2, OKVQA, TextVQA, COCO captioning, etc., surpassing or matching PaLI-X 55B.

35,547

Tsinghua KEG (THUDM) · Feb 10, 2023 · 6:34 AM UTC

Tsinghua KEG (THUDM)

@thukeg

10 Feb 2023

Replying to @srush_nlp

Lots of fun when you survived a tornado! The team enjoyed it a lot after training the GLM-130B. Technical & engineering details and lessons learned when training 100B-scale models are in the ICLR'23 paper: arxiv.org/pdf/2210.02414.pdf

2,989

Tsinghua KEG (THUDM) · Mar 16, 2023 · 6:44 PM UTC

Tsinghua KEG (THUDM)

@thukeg

16 Mar 2023

The open ChatGLM-6B model (1T tokens pretraining + SFT + RLHF) is on the top spot of today's GitHub trending repos. Thank you! Download & play the model here: github.com/THUDM/ChatGLM-6B

3,571

Tsinghua KEG (THUDM) · Dec 27, 2023 · 12:38 PM UTC

Tsinghua KEG (THUDM)

@thukeg

27 Dec 2023

#CogAgent: an open vision language model #VLM for building GUI agents, supporting 1120*1120 input & topping VQAv2, OK-VQ, TextVQA, ST-VQA, ChartQA, infoVQA, DocVQA, MM-Vet, POPE & surpassing models on GUI operation datasets including AITW and Mind2Web. Paper: arxiv.org/pdf/2312.08914.pdf Model & SFT dataset: github.com/THUDM/CogVLM

9,804

Tsinghua KEG (THUDM) · Jun 14, 2023 · 12:14 PM UTC

Tsinghua KEG (THUDM)

@thukeg

14 Jun 2023

#WebGLM: the new member of the GLM family (GLM-130B, ChatGLM-130B, ChatGLM-6B, VisualGLM-6B). It is a web-enhanced QA system based on the General Language Model (GLM). All we need next is a search engine. github.com/THUDM/WebGLM Paper: arxiv.org/pdf/2306.07906.pdf #KDD2023 @_akhaliq

9,330

Tsinghua KEG (THUDM) · Sep 20, 2022 · 1:37 PM UTC

Tsinghua KEG (THUDM)

@thukeg

20 Sep 2022

#CodeGeeX #InCoder #CodeGen #GPT-J results on HumanEval-X: a new & realistic benchmark for multilingual program synthesis with 820 human-crafted coding programs in #Python #CPP #Java #JS #Go w tests and solutions. HumanEval-X: keg.cs.tsinghua.edu.cn/codeg…

Tsinghua KEG (THUDM) · Dec 10, 2023 · 4:13 AM UTC

Tsinghua KEG (THUDM)

@thukeg

10 Dec 2023

Our #ChatGLM chief @jietang and team @ShawLiu12 @xujz0703 and many are at #NeurIPS2023. Also a proud sponsor of NeurIPS this year. See you there! github.com/THUDM/ChatGLM3

GitHub - zai-org/ChatGLM3: ChatGLM3 series: Open Bilingual Chat LLMs | 开源双语对话语言模型

ChatGLM3 series: Open Bilingual Chat LLMs | 开源双语对话语言模型 - zai-org/ChatGLM3

github.com

Sasha Rush

@srush_nlp

27 Nov 2023

I'm moderating a plenary panel at #NeurIPS2023 entitled "LLMs: Beyond Scaling" with some amazing researchers. Please send or upvote any interesting questions: dory.app/events/2KZxWFPULUn9…

8,508

Tsinghua KEG (THUDM) · Jul 25, 2023 · 1:05 PM UTC

Tsinghua KEG (THUDM)

@thukeg

25 Jul 2023

Github: github.com/THUDM/CodeGeeX2 HF : huggingface.co/THUDM/codegee…

GitHub - zai-org/CodeGeeX2: CodeGeeX2: A More Powerful Multilingual Code Generation Model

CodeGeeX2: A More Powerful Multilingual Code Generation Model - zai-org/CodeGeeX2

github.com

Tsinghua KEG (THUDM)

@thukeg

25 Jul 2023

13,629

Tsinghua KEG (THUDM) · Mar 15, 2023 · 1:03 AM UTC

Tsinghua KEG (THUDM)

@thukeg

15 Mar 2023

What a day! ChatGLM-6B & ChatGLM should have picked up another day for introduction on Twitter...🤣

3,754

Tsinghua KEG (THUDM) · Apr 30, 2023 · 4:32 PM UTC

Tsinghua KEG (THUDM)

@thukeg

30 Apr 2023

The ChatGLM and GLM-130B team is in #ICLR2023 #ICLR look forward to meeting everyone soon!!

4,868

Tsinghua KEG (THUDM) · Mar 24, 2023 · 12:44 PM UTC

Tsinghua KEG (THUDM)

@thukeg

24 Mar 2023

The center of the AI universe 🪐 thank you for everything you’ve done for the community, incredible!

@_akhaliq

24 Mar 2023

15k in last 28 days, thank you for all the support 🙏

17,459

Tsinghua KEG (THUDM) · Aug 1, 2023 · 11:20 AM UTC

Tsinghua KEG (THUDM)

@thukeg

1 Aug 2023

“Why is the sky blue?” See how #WebGLM explains it piped.video/watch?v=ohjrlYCL… #WebGLM is a web-enhanced QA system based on the General Language Model (GLM). All we need next is a search engine. huggingface.co/THUDM/WebGLM 🤗github.com/THUDM/WebGLM @kdd_news @_akhaliq #KDD2023

@_akhaliq

14 Jun 2023

WebGLM: Towards An Efficient Web-Enhanced Question Answering System with Human Preferences paper page: huggingface.co/papers/2306.0… present WebGLM, a web-enhanced question-answering system based on the General Language Model (GLM). Its goal is to augment a pre-trained large language model (LLM) with web search and retrieval capabilities while being efficient for real-world deployments. To achieve this, we develop WebGLM with strategies for the LLM-augmented retriever, bootstrapped generator, and human preference-aware scorer. Specifically, we identify and address the limitations of WebGPT (OpenAI), through which WebGLM is enabled with accuracy, efficiency, and cost-effectiveness advantages. In addition, we propose systematic criteria for evaluating web-enhanced QA systems. We conduct multi-dimensional human evaluation and quantitative ablation studies, which suggest the outperformance of the proposed WebGLM designs over existing systems. WebGLM with the 10-billion-parameter GLM (10B) is shown to perform better than the similar-sized WebGPT (13B) and even comparably to WebGPT (175B) in human evaluation.

14,457

Tsinghua KEG (THUDM) · Apr 11, 2023 · 6:11 AM UTC

Tsinghua KEG (THUDM)

@thukeg

11 Apr 2023

The paper is here: arxiv.org/abs/2303.17568

@_akhaliq

30 Sep 2022

2,079

Tsinghua KEG (THUDM) · Mar 15, 2023 · 2:19 AM UTC

Tsinghua KEG (THUDM)

@thukeg

15 Mar 2023

ChatGLM-6B (1T tokens, SFT,RLHF) freely download here github.com/THUDM/ChatGLM-6B The model is bilingual (Chinese & English ) but sft is optimized for Chinese for now. Looking to see how we can learn from #alpaca on English. ChatGLM.cn based on glm-130b is online!

GitHub - zai-org/ChatGLM-6B: ChatGLM-6B: An Open Bilingual Dialogue Language Model | 开源双语对话语言模型

ChatGLM-6B: An Open Bilingual Dialogue Language Model | 开源双语对话语言模型 - zai-org/ChatGLM-6B

github.com

toucan

@distributionat

14 Mar 2023

Models announced today: - @OpenAI GPT4 - @AnthropicAI Claude - @thukeg ChatGLM - @GoogleAI Med-PaLM 2 Any more I need to add to foundationmodeltracker.com?

1,951

Tsinghua KEG (THUDM) · Oct 27, 2022 · 3:30 PM UTC

Tsinghua KEG (THUDM)

@thukeg

27 Oct 2022

Replying to @percyliang

GLM-130B attempts to do this w/ completely open code, data, and all issues faced and lessons learned. So it could be a/the possible answer? github.com/THUDM/GLM-130B

GitHub - zai-org/GLM-130B: GLM-130B: An Open Bilingual Pre-Trained Model (ICLR 2023)

GLM-130B: An Open Bilingual Pre-Trained Model (ICLR 2023) - zai-org/GLM-130B

github.com

Tsinghua KEG (THUDM) · Apr 13, 2023 · 1:35 PM UTC

Tsinghua KEG (THUDM)

@thukeg

13 Apr 2023

ImageReward: an open general-purpose text-to-image human preference Reward Model & | a t2i scoring metric 30%+ over CLIP, BLIP, & Aesthetic in terms of human preferences. github.com/THUDM/ImageReward

GitHub - zai-org/ImageReward: [NeurIPS 2023] ImageReward: Learning and Evaluating Human Preferences...

[NeurIPS 2023] ImageReward: Learning and Evaluating Human Preferences for Text-to-image Generation - zai-org/ImageReward

github.com

@_akhaliq

13 Apr 2023

ImageReward: Learning and Evaluating Human Preferences for Text-to-Image Generation abs: arxiv.org/abs/2304.05977 github: github.com/THUDM/ImageReward

2,185

Tsinghua KEG (THUDM) · Jul 5, 2023 · 11:54 AM UTC

Tsinghua KEG (THUDM)

@thukeg

5 Jul 2023

ChatGLM22222222-6B tops the trending 🙏

clem 🤗

@ClementDelangue

5 Jul 2023

Trending models, datasets and spaces of the week, now at hf.co/models, hf.co/datasets & hf.co/spaces with all the filters by tasks, languages, licenses and more! Congrats to @salesforce @OpenChatCo @erhartford @MosaicML @Replit @fkadev @MetaAI @TIIuae and many others!

4,713

Tsinghua KEG (THUDM) · Dec 28, 2023 · 5:30 AM UTC

Tsinghua KEG (THUDM)

@thukeg

28 Dec 2023

#CogAgent Here are the HF links for downloading the models: Chat version🤗: huggingface.co/THUDM/cogagen… VQA version🤗: huggingface.co/THUDM/cogagen…

zai-org/cogagent-chat-hf · Hugging Face

We’re on a journey to advance and democratize artificial intelligence through open source and open science.

huggingface.co

Tsinghua KEG (THUDM)

@thukeg

27 Dec 2023

2,082

Tsinghua KEG (THUDM) · Feb 21, 2023 · 2:26 AM UTC

Tsinghua KEG (THUDM)

@thukeg

21 Feb 2023

Our #CodeGeeX (13B) code generation & translation model is serving programmers on VS Code, JetBrains, and CloudStudio (Tencent's cloud IDE). And it is FREE! Here is its performance among multilingual code models (evaluated on Sep. 2022):

CodeGeeX @codegeex_ai

21 Feb 2023

CodeGeeX is an AI-powered code generation tool designed to accelerate your coding process. Similar to GitHub Copilot, but CodeGeeX is available for free. medium.com/@innovation64feng… @sama @elonmusk @liuren @zibuyu9 @liujiang @BillGates @csdncto

2,190

Tsinghua KEG (THUDM) · Mar 25, 2023 · 7:12 AM UTC

Tsinghua KEG (THUDM)

@thukeg

25 Mar 2023

A brilliant #ChatGLM application scenario: Chat with Books or Wikipedia. github.com/THUDM/ChatGLM-6B

GitHub - zai-org/ChatGLM-6B: ChatGLM-6B: An Open Bilingual Dialogue Language Model | 开源双语对话语言模型

ChatGLM-6B: An Open Bilingual Dialogue Language Model | 开源双语对话语言模型 - zai-org/ChatGLM-6B

github.com

G_Z

@GZhan57

25 Mar 2023

用本地的 #ChatGLM 配合看书! 修改了下之前写的用ChatGPT辅助读书写的一套代码, 试着集成了@thukeg的开源版LLM, 可以基于书的内容的进行长问答+显示出处与章节. 话说他们最近还出了embedding, 还没来得及用. 期待更多中文作为第一公民的LLM了.

2,276

Tsinghua KEG (THUDM) · Mar 14, 2023 · 3:50 PM UTC

Tsinghua KEG (THUDM)

@thukeg

14 Mar 2023

ChatGLM (chatglm.cn), the chat model based on GLM-130B, has been under alpha test w/ invited only-users since Feb. 28. See details here chatglm.cn/blog (in Chinese)

2,226

Tsinghua KEG (THUDM) · May 29, 2023 · 3:09 PM UTC

Tsinghua KEG (THUDM)

@thukeg

29 May 2023

March 14: ChatGLM-6B 😉 github.com/THUDM/ChatGLM-6B/… huggingface.co/THUDM/chatglm… May 18: VisualGLM-6B 😄 huggingface.co/THUDM/visualg… github.com/THUDM/VisualGLM-6…

ChatGLM-6B/README_en.md at main · zai-org/ChatGLM-6B

ChatGLM-6B: An Open Bilingual Dialogue Language Model | 开源双语对话语言模型 - zai-org/ChatGLM-6B

github.com

Omar Sanseviero

@osanseviero

29 May 2023

May is not over, but so many exciting things this month! (2 days to surprise us!) 🔥QLoRA: 4-bit finetuning 🌸StarCoder and StarChat, SOTA Open Source Code models 🔊5x faster Whisper with less than 8GB GPU Check them out and add any missing! ⤵️ github.com/osanseviero/ml_ti…

2,435

Tsinghua KEG (THUDM) · Jul 11, 2023 · 3:06 AM UTC

Tsinghua KEG (THUDM)

@thukeg

11 Jul 2023

New protein & antibody #LLMs in the GLM family: #xTrimoPGLM-100B: protein understanding & generation. SOTA in 13 of 15 tasks. #xTrimoPGLM-Ab-1B: for antibodies, outperforming AF2 in structure prediction, 4000x inference speedup. Paper: biorxiv.org/content/10.1101/… @_akhaliq

4,807

Tsinghua KEG (THUDM) · Mar 23, 2023 · 11:52 AM UTC

Tsinghua KEG (THUDM)

@thukeg

23 Mar 2023

Steaming output. ChatGLM-6B!

Yuvi @yvrjsharma

23 Mar 2023

🚨🤖 Exciting news! Check out our new @Gradio Chatbot demo for ChatGLM-6B to create a bilingual chatbot with Streaming capabilities! @thukeg [Video]:🤩With Gradio's new 🎨Theme feature, you can customize the chatbot to your liking just as I've tried adding THUDM colors🟪to the mix! We have a Themes hackathon underway too 🏆 JOIN- huggingface.co/Gradio-Themes 💬 ChatGLM-6B answers align with human preference. The model outputs are streamed directly to the chatbot component for seamless interaction. So, get ready to chat away with AI on Gradio! 🙌🚀 Demo on @huggingface - huggingface.co/spaces/ysharm…

2,852

Tsinghua KEG (THUDM) · Oct 23, 2023 · 11:32 AM UTC

Tsinghua KEG (THUDM)

@thukeg

23 Oct 2023

#AgentTuning: Enabling Generalized Agent Abilities For LLMs, e.g., let #Llama2-7/13/70B to achieve 3.5-level agent abilities while stay good on MMLU, GSM8K, HumanEval. All models were behind GPT-4/3.5, Claude on #AgentBench. 🤗huggingface.co/THUDM/agentlm… github.com/THUDM/AgentTuning

4,113

Tsinghua KEG (THUDM) · Oct 11, 2022 · 2:05 PM UTC

Tsinghua KEG (THUDM)

@thukeg

11 Oct 2022

Thanks @Tim_Dettmers!! Super work on book and opt etc. It’d be great to chat on further pushing the boundary to make 100B-scale models as “small” as possible!

Tim Dettmers

@Tim_Dettmers

10 Oct 2022

GLM-130B is excellent concurrent work with our LLM.int8(). It is wonderful to see how much you can learn by studying both side-by-side — each enhanced by others' insights. I can confirm many of the findings, e.g. scaling laws, but also have immediately new ideas to improve it!

Tsinghua KEG (THUDM) · Aug 17, 2022 · 5:56 AM UTC

Tsinghua KEG (THUDM)

@thukeg

17 Aug 2022

Thanks @_akhaliq & his team for helping set up the @huggingface space for KEG: huggingface.co/thudm GLM-130B, CogView2, CogVideo are included!!

@_akhaliq

5 Aug 2022

A @Gradio Demo for GLM-130B: An Open Bilingual Pre-Trained Model on @huggingface Spaces by huggingface.co/hanyullai demo: huggingface.co/spaces/hanyul… Get started with Gradio: gradio.app/getting_started/

Tsinghua KEG (THUDM) · Oct 16, 2023 · 6:02 PM UTC

Tsinghua KEG (THUDM)

@thukeg

16 Oct 2023

Another examples on Programming, world_knowledge, Referring Expression Comprehension (REC), and vqa

Tsinghua KEG (THUDM)

@thukeg

16 Oct 2023

1,840

Tsinghua KEG (THUDM) · Aug 4, 2022 · 6:15 PM UTC

Tsinghua KEG (THUDM)

@thukeg

4 Aug 2022

Our intermediate results on LAMBADA

Tsinghua KEG (THUDM) · Aug 8, 2023 · 6:52 AM UTC

Tsinghua KEG (THUDM)

@thukeg

8 Aug 2023

#AgentBench 🤯Static NLP datasets are not enough for evaluating existing #LLMs 🌟We should test them in practical interactive environments for agents! Find more videos for LLM-as-Agent in #AgentBench at llmbench.ai

Xiao Liu (Shaw)

@ShawLiu12

8 Aug 2023

Thanks @arankomatsuzaki for sharing our paper #AgentBench ! 🤯Static NLP datasets are not enough for evaluating existing LLMs 🌟We should test them in practical interactive environments for agents! Find more videos for LLM-as-Agent in AgentBench at llmbench.ai !

1,978

Tsinghua KEG (THUDM) · Sep 6, 2022 · 1:37 AM UTC

Tsinghua KEG (THUDM)

@thukeg

6 Sep 2022

Introducing kgTransformer at #KDD2022. It is designed to pre-train transformers on massive knowledge graphs with Mixture-of-Experts (MoE). It can answer complex logical queries in a unified manner. @kdd_news Kudos to @ShawLiu12, Shiyu, Kai & all! code: github.com/THUDM/kgTransform…

Tsinghua KEG (THUDM) · Sep 21, 2022 · 1:12 AM UTC

Tsinghua KEG (THUDM)

@thukeg

21 Sep 2022

Thanks a million @LoubnaBenAllal1! You guys from @huggingface have been always so nice to help with this (immediately)!! Last time, @_akhaliq @osanseviero helped a lot on GLM-130B and our team space huggingface.co/thudm. Thank you!

Loubna Ben Allal

@LoubnaBenAllal1

20 Sep 2022

Replying to @thukeg

Great work! We added HumanEval-X to the Hugging Face hub and we can transfer it to your HF organization huggingface.co/datasets/loub…. It would be great to have the models there too!

Tsinghua KEG (THUDM) · Sep 21, 2022 · 1:07 AM UTC

Tsinghua KEG (THUDM)

@thukeg

21 Sep 2022

We are making #CodeGeeX freely available on VS Code (and with extra features such as code translation btw programming languages).

Bill Tribble @bill_tribble

20 Sep 2022

Replying to @thukeg

So like this kinda thing might be a Copilot competitor? @b_antunes

Tsinghua KEG (THUDM) · Sep 15, 2022 · 2:48 AM UTC

Tsinghua KEG (THUDM)

@thukeg

15 Sep 2022

Excited to introduce XDAI #KDD2022, a toolkit for exploiting pretrained #LLM in knowledge-grounded dialogue generation without any training or fine-tuning. #GLM130B @kdd_news Try XDAI ChatBot (Chinese Only for now): models.aminer.cn/xdai/ Developers: github.com/THUDM/XDAI

Tsinghua KEG (THUDM) · Mar 28, 2023 · 4:22 AM UTC

Tsinghua KEG (THUDM)

@thukeg

28 Mar 2023

😅 thanks for the feedback! Lots of work to do :)

Matt Sheehan @mattsheehan88

27 Mar 2023

The best open-source chatbot from 🇨🇳 is now available to play with: ChatGLM-6B from Tsinghua's @thukeg. Try it out: huggingface.co/spaces/ysharm… It's impressive on many fronts, and displays many of the same weaknesses as other LLMs. Also does some interesting English-Chinese mixing👇

2,819

Tsinghua KEG (THUDM) · Aug 17, 2022 · 5:46 AM UTC

Tsinghua KEG (THUDM)

@thukeg

17 Aug 2022

Jiezhong Qiu, a KEG alum at Tsinghua CS, wins the KDD'22 Outstanding Dissertation Award RunnerUp for his work on Graph Rep Learning. First time this award goes to a group in Asia since established in 2008! Congrats Jiezhong & his advisor @jietang @Tsinghua_Uni @kdd_news #KDD2022

Tsinghua KEG (THUDM) · Aug 3, 2023 · 2:53 PM UTC

Tsinghua KEG (THUDM)

@thukeg

3 Aug 2023

Wow what a great lineup of speakers: CoT, GPT-4, Llama2, ChatGLM2, PaLM2!

SIGKDD 2026 @kdd_news

3 Aug 2023

The hottest topic of the premier conference! Please attend the LLM day at KDD-2023 next week! August 8, Los Angeles. bigmodel.ai/llmday-kdd23/

998

Tsinghua KEG (THUDM) · Sep 21, 2022 · 1:43 AM UTC

Tsinghua KEG (THUDM)

@thukeg

21 Sep 2022

YES!! The current plan is to release #CodeGeeX model weights next week! Stay tuned ;)

Alex Leiva @aviel08

21 Sep 2022

Replying to @thukeg

Language translation and open source!?

ALT Pikachu Shocked Face Stunned GIF

Tsinghua KEG (THUDM) · Oct 22, 2023 · 9:27 AM UTC

Tsinghua KEG (THUDM)

@thukeg

22 Oct 2023

Btw, our CogVLM arxiv submission (ArXiv ID 5148899) has been "on hold" for about two weeks without clear reasons. Is arXiv supposed to be a timely "publishing" model? Please help if possible @arxiv @_akhaliq 😖

Tsinghua KEG (THUDM)

@thukeg

16 Oct 2023

1,552

Tsinghua KEG (THUDM) · Apr 27, 2023 · 1:09 PM UTC

Tsinghua KEG (THUDM)

@thukeg

27 Apr 2023

#CodeGeeX open model (and paper) at github.com/THUDM/CodeGeeX

Rowan Cheung

@rowancheung

27 Apr 2023

Replying to @rowancheung @svpino

3. Replit reveals replit-code-v1-3b Replit introduced their very own open-sourced LLaMa style LLM, 'replit-code-v1-3b.' It's trained on 2.7 billion parameters and performs 40% better than comparable models.

1,393

Tsinghua KEG (THUDM) · Apr 9, 2023 · 9:19 AM UTC

Tsinghua KEG (THUDM)

@thukeg

9 Apr 2023

Thanks for cooking the video! Proud to push the open sourced advances in LLMs!

Harrison Kinsley

@Sentdex

8 Apr 2023

Exploring the concept of a GLM (General Language Model) and working with ChatGLM6B. ChatGLM6B is a 6.2B parm LLM, similar to ChatGPT, that can run on small as 6GB of memory. Video: piped.video/watch?v=fGpXj4bl…

3,294

Tsinghua KEG (THUDM) · Sep 21, 2022 · 7:07 AM UTC

Tsinghua KEG (THUDM)

@thukeg

21 Sep 2022

Thanks for trying! Plz feel free to share suggestions/comments!

Thibault Castells

@thibcastells

21 Sep 2022

Open source, and seems to do the job! I just wrote the comment, then this is what I got:

Tsinghua KEG (THUDM) · Jan 8, 2023 · 10:36 AM UTC

Tsinghua KEG (THUDM)

@thukeg

8 Jan 2023

Cool, thanks for sharing!! GLM-130B was an attempt to open-source a 100B-scale model at least as good as GPT-3 (davinci) and unveil how models of such a scale can be successfully pre-trained! Stay tuned for more advanced models. Btn GPT3 and ChatGPT&3.5, there was a long way...

This tweet is unavailable

528

Tsinghua KEG (THUDM) · Aug 4, 2022 · 8:27 AM UTC

Tsinghua KEG (THUDM)

@thukeg

4 Aug 2022

The KEG team's paper on AMiner.cn wins the SIGKDD Test of Time Award!

SIGKDD 2026 @kdd_news

14 Aug 2020

Jie Tang, Jing Zhang, Limin Yao, Juanzi Li, Li Zhang and Zhong Su received the Test of Time Award for Applied Science in recognition of their study of mining academic social networks published in the 2008 paper, "ArnetMiner: Extraction And Mining Of Academic Social Networks."

Tsinghua KEG (THUDM) · Jun 14, 2023 · 12:18 PM UTC

Tsinghua KEG (THUDM)

@thukeg

14 Jun 2023

Thanks! #WebGLM accepted at #KDD2023

Aran Komatsuzaki

@arankomatsuzaki

14 Jun 2023

WebGLM: Towards An Efficient Web-Enhanced Question Answering System with Human Preferences 10B WebGLM performs better than 13B WebGPT and even comparably to 175B WebGPT in human evaluation. The code, demo, and data are released. repo: github.com/THUDM/WebGLM abs: arxiv.org/abs/2306.07906

1,598

Tsinghua KEG (THUDM) · Oct 16, 2023 · 6:00 PM UTC

Tsinghua KEG (THUDM)

@thukeg

16 Oct 2023

Here are more examples on OCR_free_reasoning, OCR_free_vqa, grounding_vqa, grounding_with_caption.

Tsinghua KEG (THUDM)

@thukeg

16 Oct 2023

912

Tsinghua KEG (THUDM) · Apr 4, 2023 · 3:45 AM UTC

Tsinghua KEG (THUDM)

@thukeg

4 Apr 2023

And @thukeg 😄

clem 🤗

@ClementDelangue

3 Apr 2023

Trending models, datasets and spaces of the week on hf.co. Congrats to @cerebras @databricks @BananaDev_ (April Fool) @AnthropicAI @nomic_ai @Picsart @AlibabaGroup (excited about text to video!!) and all others!

2,019

Tsinghua KEG (THUDM) · Jan 8, 2023 · 10:41 AM UTC

Tsinghua KEG (THUDM)

@thukeg

8 Jan 2023

Yes, big models from Google are great!!!

Yi Tay

@YiTayML

8 Jan 2023

MMLU in the 40s..... Flan-T5 Large (700M params) beats this lol. 😂

516

Tsinghua KEG (THUDM) · Jan 8, 2023 · 1:31 PM UTC

Tsinghua KEG (THUDM)

@thukeg

8 Jan 2023

There is a lot of work happening from davinci (GPT-3) to code-davinci-002 to text-davinci-002 (InstructGPT) to text-davinci-003 and beyond. The (currently) open-sourced GLM-130B is only on its first step. The mountain has many paths...but no easy one.

Sengxian @Sengxian

8 Jan 2023

Replying to @ItakGol @AndyChenML

Maybe this question is too hard for a LLM without RLHF. Neither davinci nor text-davinci-002 can do it, only text-davinci-003 and chatgpt with RLHF can answer correctly

825

Tsinghua KEG (THUDM) · Jan 5, 2024 · 3:11 AM UTC

Tsinghua KEG (THUDM)

@thukeg

5 Jan 2024

Big congrats! @ShawLiu12 from our team and Huan'ang!

Tsinghua CS @thudcst

5 Jan 2024

We are thrilled to announce that Gao Huan'ang and Liu Xiao from #Tsinghua #TsinghuaDCST have won the 2023 Tsinghua University Top Student Award! This is the 5th year in a row that our department has excelled in both undergraduate and graduate levels. Congrats! 🎉🎉🎉

910

Tsinghua KEG (THUDM) · Jan 8, 2023 · 10:51 AM UTC

Tsinghua KEG (THUDM)

@thukeg

8 Jan 2023

Well said. In addition, GLM-130B was an attempt to open-source a 100B-scale model at least as good as GPT-3 (**davinci**) and unveil how models of such a scale can be successfully pre-trained! Btn GPT3(2020.05) and ChatGPT (2022.11)&3.5, there was a long way...Stay tuned🤗

This tweet is unavailable

768

Tsinghua KEG (THUDM) · Nov 24, 2023 · 5:19 AM UTC

Tsinghua KEG (THUDM)

@thukeg

24 Nov 2023

github.com/THUDM/CogVLM

GitHub - zai-org/CogVLM: a state-of-the-art-level open visual language model | 多模态预训练模型

a state-of-the-art-level open visual language model | 多模态预训练模型 - zai-org/CogVLM

github.com

SkalskiP

@skalskip92

23 Nov 2023

looking for OpenAI-4V alternatives? - LLaVA - BakLLaVA - CogVLM - Qwen-VL different tasks: - VQA - answering questions about images - OCR - reading text - zero-shot detections link: blog.roboflow.com/gpt-4-visi…

888

Tsinghua KEG (THUDM) · Mar 18, 2023 · 5:38 AM UTC

Tsinghua KEG (THUDM)

@thukeg

18 Mar 2023

Replying to @SamChen8008 @osanseviero

FYI, this model was trained from scratch (1T tokens + SFT&RLHF). It is the small version of ChatGLM (GLM-130B + SFT&RLHF) github.com/THUDM/GLM-130B

120

Tsinghua KEG (THUDM) · Nov 11, 2023 · 7:38 AM UTC

Tsinghua KEG (THUDM)

@thukeg

11 Nov 2023

wow, proud that we @thukeg (thudm) from @thudcst @Tsinghua_Uni are among the very top. Also thanks @huggingface 🤗 github.com/THUDM

THUKEG

ChatGLM, GLM-4, CogVLM, CodeGeeX, CogView, ImageReward, CogVideoX | CogDL, GraphMAE, AMiner | Zhipu.ai (Z.ai) & Knowledge Engineering Group (KEG) - THUKEG

github.com

Omar Sanseviero

@osanseviero

11 Nov 2023

I was curious about which universities are using Hugging Face. The answer: over 5000 groups 🤯 Explore all universities at huggingface.co/spaces/osanse… Some groups with the most likes: @thukeg, @HelsinkiNLP, @humphrey_shi Labs, @UKPLab, @uwnlp, and @stanfordnlp 🔥

1,613

Tsinghua KEG (THUDM) · Jun 14, 2023 · 12:16 PM UTC

Tsinghua KEG (THUDM)

@thukeg

14 Jun 2023

Thanks AK! GLM -> GLM-130B -> ChatGLM-6B/130B -> VisualGLM-6B -> #WebGLM -> ?

@_akhaliq

14 Jun 2023

925

Tsinghua KEG (THUDM) · Aug 17, 2022 · 5:49 AM UTC

Tsinghua KEG (THUDM)

@thukeg

17 Aug 2022

Congrats Yuxiao! @ericdongyx #KDD2022 @Tsinghua_Uni @kdd_news

Nitesh Chawla @nvchawla

15 Aug 2022

Proud advisor moment: my PhD advisee, Yuxiao Dong, received the SIGKDD Rising Star Award @kdd_news. It is so well deserving for Yuxiao. #proudadvisor #datascience #KDD2022 @notredame @lucy_institute

Tsinghua KEG (THUDM) · Nov 24, 2023 · 2:55 AM UTC

Tsinghua KEG (THUDM)

@thukeg

24 Nov 2023

💜💙🤗🤗🧡♥️ Thank you for all the LIKEs!

Omar Sanseviero

@osanseviero

22 Nov 2023

The top 15 most-liked organizations on @huggingface 1. @StabilityAI 20k likes 2. @AIatMeta 20k 3. @runwayml 11k 4. CompVis 10k 5. @thukeg 7k 6. @BigscienceW 7k 7. @TIIuae 7k 8. @Microsoft 6.5k 9. @GoogleAI 6k 10. @OpenAI 4k 11. @BigCodeProject 4k 12. @MosaicML 4k 13. @UKPLab 3k 14. @AiEleuther 3k 15. @salesforce 3k huggingface.co/spaces/Pulsar…

1,054

Tsinghua KEG (THUDM) · Mar 29, 2023 · 2:28 PM UTC

Tsinghua KEG (THUDM)

@thukeg

29 Mar 2023

🔥

Julien Chaumond

@julien_c

29 Mar 2023

oh noooo

1,135

Tsinghua KEG (THUDM) · Aug 5, 2022 · 7:37 AM UTC

Tsinghua KEG (THUDM)

@thukeg

5 Aug 2022

Replying to @osanseviero

Thank you! That will be great! Messaging you right now for details.

Tsinghua KEG (THUDM) · Nov 24, 2023 · 5:20 AM UTC

Tsinghua KEG (THUDM)

@thukeg

24 Nov 2023

#CogVLM

SkalskiP

@skalskip92

23 Nov 2023

I'm super impressed with Qwen-VL and CogVLM! I've done a few (probably very naive) tests to compare LLaVA, BakLLaVA, Qwen-VL, CogVLM, and GPT-4V. Tests include VQA, OCR, and zero-shot detection. Any ideas on what else I should test?

984

Tsinghua KEG (THUDM) · Apr 30, 2023 · 3:10 AM UTC

Tsinghua KEG (THUDM)

@thukeg

30 Apr 2023

happy 112-th birthday!

Tsinghua University

@Tsinghua_Uni

24 Apr 2023

Get excited, everyone! Tsinghua's birthday is just around the corner! Join our #Tsinghua112 anniversary celebration on the last Sunday of April by sending your #HappyBirthdayTsinghua wish or sharing a photo of you on campus to recall #MyTsinghuaStory memories.

1,963

Tsinghua KEG (THUDM) · Aug 6, 2022 · 6:07 AM UTC

Tsinghua KEG (THUDM)

@thukeg

6 Aug 2022

Replying to @marekkraft

Lol. As mentioned in the blog, we are also working on quantization to make it work for 3090s (and possibly one or two A100 cards)

Tsinghua KEG (THUDM) · Nov 23, 2023 · 11:36 AM UTC

Tsinghua KEG (THUDM)

@thukeg

23 Nov 2023

Big congrats to Prof Shimin! Very well deserved!! We’ve learned so much from him!!

Tsinghua CS @thudcst

23 Nov 2023

Professor Hu Shimin from #Tsinghua DCST was elected as Academician of Chinese Academy of Sciences, for his great contributions to Computer Graphics, Geometric Computing and Artificial Intelligence! He also developed the widely-used DL framework Jittor. Congrats to Prof Hu! 🎉🎉🎉

957

Tsinghua KEG (THUDM) · Mar 14, 2023 · 3:55 PM UTC

Tsinghua KEG (THUDM)

@thukeg

14 Mar 2023

Love to hear your thoughts on ChatGLM-6B and ChatGLM.cn @osanseviero @ChenhaoTan @_akhaliq @percyliang #LLM

Tsinghua KEG (THUDM)

@thukeg

14 Mar 2023

768

Tsinghua KEG (THUDM) · Aug 4, 2022 · 6:15 PM UTC

Tsinghua KEG (THUDM)

@thukeg

4 Aug 2022

Our intermediate results on MMLU

Tsinghua KEG (THUDM) · May 29, 2023 · 2:38 PM UTC

Tsinghua KEG (THUDM)

@thukeg

29 May 2023

Happy to see this: "RL fine-tuning with reward function from human feedback (ImageReward: arxiv.org/abs/2304.05977) can reduce bias in the pre-trained model." #ImageReward: github.com/THUDM/ImageReward

Kimin

@kimin_le2

25 May 2023

❓ What is an effective approach for fine-tuning pre-trained t2i diffusion models using a reward function? 💡 I'm excited to share "DPOK: Reinforcement Learning for Fine-tuning Text-to-Image Diffusion Models" co-led by @yingfan_bot Website: sites.google.com/view/dpok-t… 🧵 1/N

897

Tsinghua KEG (THUDM) · Oct 28, 2022 · 1:14 AM UTC

Tsinghua KEG (THUDM)

@thukeg

28 Oct 2022

Replying to @percyliang

Good point! We have scripts to perform all evaluation tasks. Let us also make the script for model training open on GitHub as well. We'll keep everyone updated on this! Thanks for the suggestion!!

Tsinghua KEG (THUDM) · Apr 4, 2023 · 6:09 AM UTC

Tsinghua KEG (THUDM)

@thukeg

4 Apr 2023

@_akhaliq

Aran Komatsuzaki

@arankomatsuzaki

4 Apr 2023

I'm new to AI and ChatGPT. Who should I follow to learn more about them?

256

Tsinghua KEG (THUDM) · Mar 28, 2023 · 12:10 PM UTC

Tsinghua KEG (THUDM)

@thukeg

28 Mar 2023

Replying to @beaver_hopping @astoks @mattsheehan88

FYI, pre-trained on both Chinese and English, and SFT mostly on Chinese.

476

Tsinghua KEG (THUDM) · Nov 9, 2023 · 8:04 AM UTC

Tsinghua KEG (THUDM)

@thukeg

9 Nov 2023

Big congrats to everyone! It was led by Mr. Pengcheng Wang (the left), a master student from our KEG lab! It is about using LLMs to power digital human interactions! Our digital humans was also used in the Beijing 2022 Winter Olympics & Paralympics games w/ sign language!

Tsinghua CS @thudcst

7 Nov 2023

🎉🎉Hats off to the talented student teams from Tsinghua for bagging top honors at 18th "Challenge Cup". "Interactive Live Digital Human" creates interactive live digital entities with personality, powered by Large Language Models. The project gained 30K+ followers on Bilibili👏

1,085

Tsinghua KEG (THUDM) · Sep 22, 2022 · 8:13 AM UTC

Tsinghua KEG (THUDM)

@thukeg

22 Sep 2022

Replying to @ncooper57

Aha, good question! This is something that we'll discuss about it internally.

Tsinghua KEG (THUDM) · Apr 25, 2023 · 4:06 AM UTC

Tsinghua KEG (THUDM)

@thukeg

25 Apr 2023

We are flattered but that is a tough position. In any case, OpenAI is #1 likely with a distant second not to mention others

Matt Sheehan @mattsheehan88

24 Apr 2023

Gotta disagree w/ @BradSmi on Beijing Academy of AI as one of 3 orgs at "absolute forefront" of generative AI. BAAI's WuDao model was announced to much fanfare, but never released or externally tested. Either Baidu or Tsinghua's @thukeg and it's GLM models likely leading China.

1,463

Tsinghua KEG (THUDM) · Oct 28, 2022 · 2:46 PM UTC

Tsinghua KEG (THUDM)

@thukeg

28 Oct 2022

Replying to @thukeg @percyliang

Here is the link for training GLM-130B. plz feel free to let us know of any questions or issues. github.com/THUDM/LargeScale/…

Tsinghua KEG (THUDM) · Sep 22, 2022 · 8:12 AM UTC

Tsinghua KEG (THUDM)

@thukeg

22 Sep 2022

Replying to @qumeric

Thx! Had results on MBPP. Will need to work on APPS.

Tsinghua KEG (THUDM) · Aug 5, 2022 · 6:10 AM UTC

Tsinghua KEG (THUDM)

@thukeg

5 Aug 2022

Replying to @lazilyoptimal

Good point! We'd love to have it run on TPU as well but unfortunately we can't get access to TPU for testing for now ...

Tsinghua KEG (THUDM) · Oct 27, 2022 · 3:29 PM UTC

Tsinghua KEG (THUDM)

@thukeg

27 Oct 2022

GLM-130B attempts to do this w/ completely open code, data, and all issues faced and lessons learned. So it could be a/the possible answer? github.com/THUDM/GLM-130B

Percy Liang

@percyliang

27 Oct 2022

What is the largest fully reproducible language model? That is, where I can get the data and code and run a sequence of commands that deterministically produces the exact model?

Tsinghua KEG (THUDM) · Nov 28, 2023 · 4:35 PM UTC

Tsinghua KEG (THUDM)

@thukeg

28 Nov 2023

@jietang

Sasha Rush

@srush_nlp

27 Nov 2023

I'm moderating a plenary panel at #NeurIPS2023 entitled "LLMs: Beyond Scaling" with some amazing researchers. Please send or upvote any interesting questions: dory.app/events/2KZxWFPULUn9…

227

Tsinghua KEG (THUDM) · Apr 30, 2023 · 5:25 PM UTC

Tsinghua KEG (THUDM)

@thukeg

30 Apr 2023

the #CogVideo team is also there!

Tsinghua KEG (THUDM)

@thukeg

30 Apr 2023

The ChatGLM and GLM-130B team is in #ICLR2023 #ICLR look forward to meeting everyone soon!!

904

Tsinghua KEG (THUDM) · Aug 5, 2022 · 7:56 AM UTC

Tsinghua KEG (THUDM)

@thukeg

5 Aug 2022

Replying to @dustinvtran

Lol 😉 Here is the ACL'22 paper on "why" GLM aclanthology.org/2022.acl-lo… GLM: General Language Model Pretraining with Autoregressive Blank Infilling

GLM: General Language Model Pretraining with Autoregressive Blank Infilling

Zhengxiao Du, Yujie Qian, Xiao Liu, Ming Ding, Jiezhong Qiu, Zhilin Yang, Jie Tang. Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)....

aclanthology.org