Skip to content

Actions: pytorch/torchft

Unit Tests

Actions

Loading...
Loading

Show workflow options

Create status badge

Loading
374 workflow runs
374 workflow runs

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

checkpointing/HTTPTransport: added streaming serialization and parall…
Unit Tests #324: Commit f44aaa5 pushed by d4l3k
February 11, 2025 21:23 9m 22s main
February 11, 2025 21:23 9m 22s
checkpointing/HTTPTransport: added streaming serialization and parallel transfer support
Unit Tests #323: Pull request #106 synchronize by d4l3k
February 11, 2025 19:48 9m 10s d4l3k/fast_http
February 11, 2025 19:48 9m 10s
Clean up localsgd backup params
Unit Tests #322: Pull request #107 opened by H-Huang
February 11, 2025 18:43 6m 48s H-Huang:diloco
February 11, 2025 18:43 6m 48s
checkpointing/HTTPTransport: added streaming serialization and parallel transfer support
Unit Tests #321: Pull request #106 synchronize by d4l3k
February 11, 2025 02:27 9m 44s d4l3k/fast_http
February 11, 2025 02:27 9m 44s
Adds reduce_scatter into torchft (#102)
Unit Tests #319: Commit e55542a pushed by allenwang28
February 10, 2025 23:25 9m 3s main
February 10, 2025 23:25 9m 3s
checkpointing: move to subfolder (#105)
Unit Tests #318: Commit 8f0d125 pushed by d4l3k
February 10, 2025 22:47 9m 1s main
February 10, 2025 22:47 9m 1s
checkpointing: move to subfolder
Unit Tests #317: Pull request #105 opened by d4l3k
February 10, 2025 22:37 9m 13s d4l3k/checkpoint_restructure
February 10, 2025 22:37 9m 13s
Adds reduce_scatter into torchft
Unit Tests #316: Pull request #102 synchronize by allenwang28
February 10, 2025 21:55 9m 2s allenwang28:collectives
February 10, 2025 21:55 9m 2s
Adds reduce_scatter into torchft
Unit Tests #315: Pull request #102 synchronize by allenwang28
February 10, 2025 21:51 8m 58s allenwang28:collectives
February 10, 2025 21:51 8m 58s
Adds reduce_scatter into torchft
Unit Tests #314: Pull request #102 synchronize by allenwang28
February 10, 2025 21:19 8m 59s allenwang28:collectives
February 10, 2025 21:19 8m 59s
make torchft work for llama3_8b 8x
Unit Tests #313: Pull request #104 synchronize by d4l3k
February 9, 2025 00:39 7m 13s d4l3k/fast_checkpoint
February 9, 2025 00:39 7m 13s
make torchft work for llama3_8b 8x
Unit Tests #312: Pull request #104 synchronize by d4l3k
February 8, 2025 05:10 7m 7s d4l3k/fast_checkpoint
February 8, 2025 05:10 7m 7s
make torchft work for llama3_8b 8x
Unit Tests #311: Pull request #104 synchronize by d4l3k
February 8, 2025 02:41 6m 53s d4l3k/fast_checkpoint
February 8, 2025 02:41 6m 53s
make torchft work for llama3_8b 8x
Unit Tests #310: Pull request #104 synchronize by d4l3k
February 8, 2025 02:00 7m 12s d4l3k/fast_checkpoint
February 8, 2025 02:00 7m 12s
make torchft work for llama3_8b 8x
Unit Tests #309: Pull request #104 synchronize by d4l3k
February 8, 2025 01:58 6m 45s d4l3k/fast_checkpoint
February 8, 2025 01:58 6m 45s
make torchft work for llama3_8b 8x
Unit Tests #308: Pull request #104 synchronize by d4l3k
February 8, 2025 00:55 6m 43s d4l3k/fast_checkpoint
February 8, 2025 00:55 6m 43s
make torchft work for llama3_8b 8x
Unit Tests #307: Pull request #104 opened by d4l3k
February 8, 2025 00:35 7m 3s d4l3k/fast_checkpoint
February 8, 2025 00:35 7m 3s
Refactors process_group_tests.py
Unit Tests #306: Pull request #103 synchronize by allenwang28
February 7, 2025 22:04 8m 57s allenwang28:pg_test_refactor
February 7, 2025 22:04 8m 57s
Refactors process_group_tests.py
Unit Tests #305: Pull request #103 synchronize by allenwang28
February 7, 2025 22:02 8m 55s allenwang28:pg_test_refactor
February 7, 2025 22:02 8m 55s
Refactors process_group_tests.py
Unit Tests #304: Pull request #103 synchronize by allenwang28
February 7, 2025 21:58 9m 4s allenwang28:pg_test_refactor
February 7, 2025 21:58 9m 4s
Refactors process_group_tests.py
Unit Tests #303: Pull request #103 synchronize by allenwang28
February 7, 2025 20:18 9m 22s allenwang28:pg_test_refactor
February 7, 2025 20:18 9m 22s
Refactors process_group_tests.py
Unit Tests #302: Pull request #103 synchronize by allenwang28
February 7, 2025 20:13 9m 11s allenwang28:pg_test_refactor
February 7, 2025 20:13 9m 11s
Refactors process_group_tests.py
Unit Tests #301: Pull request #103 opened by allenwang28
February 7, 2025 20:11 6m 58s allenwang28:pg_test_refactor
February 7, 2025 20:11 6m 58s
MonitoredQueue: fail fast when subprocess exits (#99)
Unit Tests #300: Commit 9533676 pushed by d4l3k
February 7, 2025 18:05 9m 11s main
February 7, 2025 18:05 9m 11s