-
Notifications
You must be signed in to change notification settings - Fork 195
Pull requests: huggingface/nanotron
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
[feature] Add debug_dataloader_samples utility to preview decoded dataloader samples (#184)
#368
opened May 26, 2025 by
garongkim
Loading…
6 tasks
[Feature] Hide 75% of the communication in tensor parallelism using DoMiNo
#292
opened Mar 10, 2025 by
xrsrke
Loading…
Fix unpacking issue caused by newer Flash Attention
#289
opened Mar 5, 2025 by
Stillerman
Loading…
3 of 6 tasks
[Feature] Over 99% communication overlap in Tensor Parallelism using Domino
#286
opened Mar 1, 2025 by
hwchen2017
Loading…
Add context manager to increase NCCL timeout temporarily
#280
opened Feb 18, 2025 by
thomas-bouvier
Loading…
[CICD] Add a timeout for unit tests and measure their execution time
#270
opened Jan 17, 2025 by
xrsrke
Loading…
Previous Next
ProTip!
no:milestone will show everything without a milestone.