ChatGLM: A Family of Large Language Models from GLM-130B to GLM-4 All Tools

AI-generated keywords: Large Language Models ChatGLM GLM-4 Open-sourcing Model capabilities

AI-generated Key Points

  • Introducing the ChatGLM family of large language models:
  • Ranging from GLM-130B to GLM-4 (All Tools)
  • Significant advancements made by the team in understanding and developing these models over the past year and a half
  • Latest ChatGLM models showcase remarkable progress:
  • GLM-4 (0116, 0520), GLM-4-Air (0605), and GLM-4 All Tools
  • Autonomously utilizing external tools and functions for handling complex tasks effectively
  • Performance on par with or surpassing state-of-the-art models like GPT-4 Turbo, Claude 3 Opus, Gemini 1.5 Pro
  • Particularly excelling in tasks related to the Chinese language
  • Commitment to promoting accessibility and safety in Large Language Models (LLMs):
  • Open release of model weights and techniques developed throughout the journey
  • Over 10 million downloads on platforms like Hugging Face in 2023 alone
  • Future focus on democratizing cutting-edge LLM technologies through open sourcing:
  • Developing even more capable models based on gained knowledge
  • Pushing boundaries of model capabilities towards teaching machines to think like humans
  • Gratitude extended to contributors including data annotators, infrastructure operating staff members, collaborators, partners at Zhipu AI, Tsinghua University, Yuxuan Zhang, Wei Jia from Zhipu AI, teams at Hugging Face, ModelScope, WiseModel for their support in open-sourcing efforts
Also access our AI generated: Comprehensive summary, Lay summary, Blog-like article; or ask questions about this paper to our AI assistant.

Authors: Team GLM, :, Aohan Zeng, Bin Xu, Bowen Wang, Chenhui Zhang, Da Yin, Dan Zhang, Diego Rojas, Guanyu Feng, Hanlin Zhao, Hanyu Lai, Hao Yu, Hongning Wang, Jiadai Sun, Jiajie Zhang, Jiale Cheng, Jiayi Gui, Jie Tang, Jing Zhang, Jingyu Sun, Juanzi Li, Lei Zhao, Lindong Wu, Lucen Zhong, Mingdao Liu, Minlie Huang, Peng Zhang, Qinkai Zheng, Rui Lu, Shuaiqi Duan, Shudan Zhang, Shulin Cao, Shuxun Yang, Weng Lam Tam, Wenyi Zhao, Xiao Liu, Xiao Xia, Xiaohan Zhang, Xiaotao Gu, Xin Lv, Xinghan Liu, Xinyi Liu, Xinyue Yang, Xixuan Song, Xunkai Zhang, Yifan An, Yifan Xu, Yilin Niu, Yuantao Yang, Yueyan Li, Yushi Bai, Yuxiao Dong, Zehan Qi, Zhaoyu Wang, Zhen Yang, Zhengxiao Du, Zhenyu Hou, Zihan Wang

License: CC BY 4.0

Abstract: We introduce ChatGLM, an evolving family of large language models that we have been developing over time. This report primarily focuses on the GLM-4 language series, which includes GLM-4, GLM-4-Air, and GLM-4-9B. They represent our most capable models that are trained with all the insights and lessons gained from the preceding three generations of ChatGLM. To date, the GLM-4 models are pre-trained on ten trillions of tokens mostly in Chinese and English, along with a small set of corpus from 24 languages, and aligned primarily for Chinese and English usage. The high-quality alignment is achieved via a multi-stage post-training process, which involves supervised fine-tuning and learning from human feedback. Evaluations show that GLM-4 1) closely rivals or outperforms GPT-4 in terms of general metrics such as MMLU, GSM8K, MATH, BBH, GPQA, and HumanEval, 2) gets close to GPT-4-Turbo in instruction following as measured by IFEval, 3) matches GPT-4 Turbo (128K) and Claude 3 for long context tasks, and 4) outperforms GPT-4 in Chinese alignments as measured by AlignBench. The GLM-4 All Tools model is further aligned to understand user intent and autonomously decide when and which tool(s) touse -- including web browser, Python interpreter, text-to-image model, and user-defined functions -- to effectively complete complex tasks. In practical applications, it matches and even surpasses GPT-4 All Tools in tasks like accessing online information via web browsing and solving math problems using Python interpreter. Over the course, we have open-sourced a series of models, including ChatGLM-6B (three generations), GLM-4-9B (128K, 1M), GLM-4V-9B, WebGLM, and CodeGeeX, attracting over 10 million downloads on Hugging face in the year 2023 alone. The open models can be accessed through https://github.com/THUDM and https://huggingface.co/THUDM.

Submitted to arXiv on 18 Jun. 2024

Ask questions about this paper to our AI assistant

You can also chat with multiple papers at once here.

AI assistant instructions?

Results of the summarizing process for the arXiv paper: 2406.12793v2

Introducing the ChatGLM family of large language models: from GLM-130B to GLM-4 (All Tools). Our team has made significant advancements in understanding and developing these models over the past year and a half. With each generation, we have implemented more effective strategies for model pre-training and alignment. The latest ChatGLM models - including GLM-4 (0116, 0520), GLM-4-Air (0605), and GLM-4 All Tools - showcase remarkable progress in handling complex tasks by autonomously utilizing external tools and functions. These GLM-4 models have demonstrated performance on par with or even surpassing state-of-the-art models like GPT-4 Turbo, Claude 3 Opus, and Gemini 1.5 Pro - particularly excelling in tasks related to the Chinese language. Our commitment to promoting accessibility and safety in Large Language Models (LLMs) is evident through the open release of model weights and techniques developed throughout our journey. The open-sourcing efforts of our models have been well-received, with over 10 million downloads on platforms like Hugging Face in 2023 alone. We are currently working on developing even more capable models based on the knowledge gained so far. Moving forward, our focus remains on democratizing cutting-edge LLM technologies through open sourcing while pushing the boundaries of model capabilities towards the goal of teaching machines to think like humans. We extend our gratitude to all data annotators, infrastructure operating staff members, collaborators, partners at Zhipu AI and Tsinghua University who have contributed to the success of ChatGLM. Special thanks to Yuxuan Zhang and Wei Jia from Zhipu AI as well as teams at Hugging Face, ModelScope, WiseModel for their support in open-sourcing efforts.
Created on 20 Oct. 2024

Assess the quality of the AI-generated content by voting

Score: 0

Why do we need votes?

Votes are used to determine whether we need to re-run our summarizing tools. If the count reaches -10, our tools can be restarted.

Similar papers summarized with our AI tools

Navigate through even more similar papers through a

tree representation

Look for similar papers (in beta version)

By clicking on the button above, our algorithm will scan all papers in our database to find the closest based on the contents of the full papers and not just on metadata. Please note that it only works for papers that we have generated summaries for and you can rerun it from time to time to get a more accurate result while our database grows.

Disclaimer: The AI-based summarization tool and virtual assistant provided on this website may not always provide accurate and complete summaries or responses. We encourage you to carefully review and evaluate the generated content to ensure its quality and relevance to your needs.