Skip to content

Get ready for the open-sourcing of GLM-4-0414. #733

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Merged
merged 30 commits into from
Apr 14, 2025

Conversation

zRzRzRzRzRzRzR
Copy link
Member

With the open-sourcing of the GLM-4-0414 model, a large portion of this repository’s code will be updated. This PR presents a streamlined version after the refactoring.

  • Re-measured model runtime performance using H100.
  • Updated dependency requirements and removed a significant amount of unused and outdated code.
  • The new version GLM-4-0414 will be fully supported by native Hugging Face transformers. The previously supported THUDM/glm-4-9b-chat (non-HF) version will no longer be maintained or supported.
  • Fine-tuning code for the 32B model is not supported.
  • Code is now standardized using pre-commit.

The demo for the Z1 model has not been implemented yet. Please do not merge this PR.

Copy link

@zhangch9 zhangch9 left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Great Work!

@zRzRzRzRzRzRzR zRzRzRzRzRzRzR merged commit 9c0e421 into THUDM:main Apr 14, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants