Skip to content

Pull requests: huggingface/trl

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

Make sure chat template isn't lost when truncating prompt.
#3651 opened Jun 25, 2025 by pramodith Loading…
2 of 5 tasks
Faster position_ids computation for FFD packing
#3649 opened Jun 25, 2025 by mariosasko Loading…
1 of 5 tasks
[WIP] vllm-server-spec-dec-support
#3643 opened Jun 24, 2025 by shirinyamani Loading…
5 tasks
GRPO: Pack Responses within the same group.
#3642 opened Jun 24, 2025 by pramodith Draft
4 of 5 tasks
Add type hints to dpo_trainer.py
#3631 opened Jun 23, 2025 by bvantuan Loading…
Feature: Add SGLang support for GRPO Trainer
#3627 opened Jun 21, 2025 by PrinsYin Draft
5 tasks
[WIP] [SFT] SFT doc rewrite
#3619 opened Jun 18, 2025 by qgallouedec Loading…
5 tasks
ClearML logging of visualization in RewardTrainer evaluation
#3602 opened Jun 16, 2025 by ioverho Loading…
2 of 5 tasks
Fix: corrected fsdp in GRPO trainer
#3582 opened Jun 13, 2025 by tryumanshow Loading…
2 of 5 tasks
Check rewards shapes in RewardTrainer
#3577 opened Jun 13, 2025 by ioverho Loading…
4 tasks done
Chisquare regularized DPO
#3573 opened Jun 12, 2025 by asparius Loading…
[WIP] 🥳 new rloo
#3533 opened Jun 3, 2025 by shirinyamani Loading…
5 tasks
Push KTAE impl
#3518 opened May 30, 2025 by SamComber Loading…
5 tasks
intuit
#3513 opened May 29, 2025 by shirinyamani Loading…
5 tasks
🎀 New defaults: gradient_checkpointing=True
#3510 opened May 29, 2025 by qgallouedec Loading…
5 tasks
Add Bidirectional Knowledge Distillation Option to GKDTrainer
#3508 opened May 29, 2025 by shaischaudhry Loading…
3 of 5 tasks
HF Doc Builder style
#3498 opened May 26, 2025 by qgallouedec Draft
[GRPO] Adds SSR priorized replay buffer
#3496 opened May 26, 2025 by edbeeching Loading…
[GKD] Use vllm for the student model
#3475 opened May 21, 2025 by kashif Loading…
5 tasks
Add support for CB with native transformers
#3471 opened May 20, 2025 by ArthurZucker Loading…
Allow an user to train from a local dataset
#3470 opened May 19, 2025 by gogo2464 Loading…
1 of 5 tasks
ProTip! Updated in the last three days: updated:>2025-06-22.