Skip to content

Pull requests: InternLM/InternEvo

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Sort

Pull requests list

fix test loss
#384 opened Dec 3, 2024 by li126com Loading…
6 tasks
fix(mlp): enhance mlp_layer_fusion
#382 opened Dec 3, 2024 by yingtongxiong Loading…
1 task done
feat(isp): support switch for launch ag and forward overlap per module enhancement New feature or request
#381 opened Dec 3, 2024 by huangting4201 Loading…
5 of 6 tasks
feat(isp): add early_reduce_scatter_release support
#380 opened Dec 2, 2024 by mwiacx Loading…
fix(pp): fix pp get tensor shape err and layernorm input dtype err bug Something isn't working
#378 opened Dec 2, 2024 by huangting4201 Loading…
5 of 6 tasks
fix(gmm): change communicator.grad_hook to async
#371 opened Nov 20, 2024 by blankde Loading…
6 tasks
feat(fp8): [Work In Progress] enable FP8 training
#369 opened Nov 6, 2024 by zigzagcai Loading…
6 tasks
fix llava model device bugs
#359 opened Oct 28, 2024 by hellozmz Loading…
6 tasks
Feat/refactor process group
#358 opened Oct 28, 2024 by mwiacx Loading…
28 tasks done
feat(moe): add gshard token rearrange optim
#352 opened Oct 21, 2024 by blankde Loading…
6 tasks
feat(moe): support moe zero1 setting
#350 opened Oct 16, 2024 by blankde Loading…
6 tasks
feat(zero bubble): update zbh1
#343 opened Sep 24, 2024 by li126com Draft
6 tasks
feat(moe): add moe async param handler
#332 opened Sep 13, 2024 by blankde Loading…
6 tasks
update test loss
#329 opened Sep 13, 2024 by li126com Draft
6 tasks
Feat/add zeropp
#256 opened Jun 20, 2024 by chrysantd Loading…
6 tasks
add gradient sharding
#87 opened Mar 18, 2024 by ChenQiaoling00 Loading…
6 tasks
ProTip! Type g p on any issue or pull request to go back to the pull request listing page.