Skip to content

Issues: modelscope/ms-swift

GRPO (R1) 训练交流群
#3076 opened Feb 12, 2025 by Jintao-Huang
Open 3
Megatron-SWIFT训练交流群
#3604 opened Mar 21, 2025 by Jintao-Huang
Open
ms-swift3 Suggestion Box
#2217 opened Oct 10, 2024 by Jintao-Huang
Open 39
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Assignee
Filter by who’s assigned
Sort

Issues list

Deepspeed Zero++ 会出现Nan
#3697 opened Mar 27, 2025 by MuyeHuang
multi-node grpo training hangs
#3695 opened Mar 27, 2025 by phoenixbai
GRPO微调,gpu利用率很低
#3693 opened Mar 27, 2025 by Stephen-K1
Cache Inference Optimization
#3689 opened Mar 27, 2025 by Eduiskss
deepspeed错误
#3681 opened Mar 26, 2025 by Sakura-not-sleep
微调以后效果不好怎么办?
#3678 opened Mar 26, 2025 by Z-oo883
批次问题
#3674 opened Mar 26, 2025 by Sakura-not-sleep
xgrammar support
#3672 opened Mar 26, 2025 by jacksonjack001
KTO 训练每次保持ckt 都报错
#3669 opened Mar 26, 2025 by dhhcj
【bug】Failed to open local file in cache
#3667 opened Mar 25, 2025 by jdy18
ProTip! Type g i on any issue or pull request to go back to the issue listing page.