-
Notifications
You must be signed in to change notification settings - Fork 565
Issues: pytorch/torchtune
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Author
Label
Projects
Milestones
Assignee
Sort
Issues list
"Missing LoRA key from adapter state dict" when attempting to resume LoRA training from adapter checkpoint
#2537
opened Mar 30, 2025 by
prvnsmpth
How to separately load a local llama8B and a self-trained LoRA adapter during distillation?
#2520
opened Mar 21, 2025 by
whale2133
Use foreach compilable scale/grad / clip_grad
best practice
Things we should be doing but aren't
enhancement
New feature or request
#2517
opened Mar 19, 2025 by
IvanKobzarev
Add torch.compile to optimizer.step()
best practice
Things we should be doing but aren't
enhancement
New feature or request
#2516
opened Mar 19, 2025 by
IvanKobzarev
Unnecessarily scaling gradients when gradient_accumulation_steps is 1
best practice
Things we should be doing but aren't
enhancement
New feature or request
#2515
opened Mar 19, 2025 by
shunting314
Can we decouple the data preprocssing/tokenization step from the fine-tuning phase?
#2497
opened Mar 13, 2025 by
Electronic-Waste
Consolidate tok_encode logic between _LLMEvalWrapper and _VLMEvalWrapper
#2488
opened Mar 12, 2025 by
pbontrager
recursive_reshard
best practice
Things we should be doing but aren't
bug
Something isn't working
#2483
opened Mar 12, 2025 by
caiqi
Add add_end_token to the Qwen Models
bug
Something isn't working
community help wanted
We would love the community's help completing this issue
good first issue
Good for newcomers
#2481
opened Mar 11, 2025 by
pbontrager
Add add_end_token to Phi tokenizers
bug
Something isn't working
community help wanted
We would love the community's help completing this issue
good first issue
Good for newcomers
#2480
opened Mar 11, 2025 by
pbontrager
Add add_end_token to Mistral tokenizer
bug
Something isn't working
community help wanted
We would love the community's help completing this issue
good first issue
Good for newcomers
#2479
opened Mar 11, 2025 by
pbontrager
Add add_end_token to the Gemma Tokenizer
bug
Something isn't working
community help wanted
We would love the community's help completing this issue
good first issue
Good for newcomers
#2478
opened Mar 11, 2025 by
pbontrager
MPS memory leak
bug
Something isn't working
triage review
This issue should be discussed in weekly review
#2473
opened Mar 10, 2025 by
SalmanMohammadi
Will support multi-turn conversations?
enhancement
New feature or request
triage review
This issue should be discussed in weekly review
#2463
opened Mar 6, 2025 by
dz1iang
No chat template in evaluation
community help wanted
We would love the community's help completing this issue
enhancement
New feature or request
#2459
opened Mar 5, 2025 by
xueyan-lii
Bring in DSV3 from torchtitan
enhancement
New feature or request
#2457
opened Mar 4, 2025 by
EugenHotaj
[Question]: activation offload won't work for torch version < 2.5
discussion
Start a discussion
enhancement
New feature or request
#2456
opened Mar 4, 2025 by
Irvingwangjr
Previous Next
ProTip!
Type g p on any issue or pull request to go back to the pull request listing page.