Skip to content

Actions: AI-Hypercomputer/maxtext

Tests

Actions

Loading...
Loading

Show workflow options

Create status badge

Loading
1,422 workflow runs
1,422 workflow runs

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

Add Pallas GPU decode attention in Maxtext inference
Tests #1378: Pull request #1066 synchronize by tohaowu
February 25, 2025 23:11 34m 50s tohaowu-pallas-gpu
February 25, 2025 23:11 34m 50s
Merge pull request #1221 from kaixih:moe_fp8
Tests #1377: Commit 433726d pushed by copybara-service bot
February 25, 2025 23:11 25m 12s main
February 25, 2025 23:11 25m 12s
Add Pallas GPU decode attention in Maxtext inference
Tests #1376: Pull request #1066 synchronize by tohaowu
February 25, 2025 23:06 24m 25s tohaowu-pallas-gpu
February 25, 2025 23:06 24m 25s
[NVIDIA] Support FP8 quantization for MOE layers
Tests #1375: Pull request #1221 synchronize by kaixih
February 25, 2025 22:34 21m 34s kaixih:moe_fp8
February 25, 2025 22:34 21m 34s
Add Pallas GPU decode attention in Maxtext inference
Tests #1374: Pull request #1066 synchronize by tohaowu
February 25, 2025 22:06 27m 45s tohaowu-pallas-gpu
February 25, 2025 22:06 27m 45s
Poc elastic training
Tests #1373: Pull request #1310 opened by lukebaumann
February 25, 2025 21:48 26m 6s poc-elastic-training
February 25, 2025 21:48 26m 6s
Add Pallas GPU decode attention in Maxtext inference
Tests #1372: Pull request #1066 synchronize by tohaowu
February 25, 2025 21:45 37m 9s tohaowu-pallas-gpu
February 25, 2025 21:45 37m 9s
Add Pallas GPU decode attention in Maxtext inference
Tests #1370: Pull request #1066 synchronize by tohaowu
February 25, 2025 21:43 21m 51s tohaowu-pallas-gpu
February 25, 2025 21:43 21m 51s
Add Pallas GPU decode attention in Maxtext inference
Tests #1369: Pull request #1066 synchronize by tohaowu
February 25, 2025 21:22 28m 53s tohaowu-pallas-gpu
February 25, 2025 21:22 28m 53s
Add Pallas GPU decode attention in Maxtext inference
Tests #1368: Pull request #1066 synchronize by tohaowu
February 25, 2025 21:22 22m 35s tohaowu-pallas-gpu
February 25, 2025 21:22 22m 35s
Tests
Tests #1367: Scheduled
February 25, 2025 20:04 32m 15s main
February 25, 2025 20:04 32m 15s
Add Pallas GPU decode attention in Maxtext inference
Tests #1365: Pull request #1066 synchronize by tohaowu
February 25, 2025 19:47 28m 31s tohaowu-pallas-gpu
February 25, 2025 19:47 28m 31s
Add Pallas GPU decode attention in Maxtext inference
Tests #1364: Pull request #1066 synchronize by tohaowu
February 25, 2025 19:46 23m 16s tohaowu-pallas-gpu
February 25, 2025 19:46 23m 16s
Add Pallas GPU decode attention in Maxtext inference
Tests #1363: Pull request #1066 synchronize by tohaowu
February 25, 2025 18:22 27m 46s tohaowu-pallas-gpu
February 25, 2025 18:22 27m 46s
integrate gpu pallas flash attention . Reduce prefill time for llama70b
Tests #1362: Pull request #1305 synchronize by tohaowu
February 25, 2025 18:09 35m 13s gpu_pallas_flash
February 25, 2025 18:09 35m 13s
Add Pallas GPU decode attention in Maxtext inference
Tests #1361: Pull request #1066 synchronize by tohaowu
February 25, 2025 17:47 21m 21s tohaowu-pallas-gpu
February 25, 2025 17:47 21m 21s
Merge pull request #1308 from jharmsen:patch-1
Tests #1360: Commit 026ae6c pushed by copybara-service bot
February 25, 2025 17:33 22m 25s main
February 25, 2025 17:33 22m 25s
[gitignore] Add '.idea' dir in .gitignore
Tests #1359: Pull request #1309 opened by wyzhang
February 25, 2025 17:07 24m 9s wyzhang/misc
February 25, 2025 17:07 24m 9s
Add Pallas GPU decode attention in Maxtext inference
Tests #1358: Pull request #1066 synchronize by tohaowu
February 25, 2025 16:31 20m 54s tohaowu-pallas-gpu
February 25, 2025 16:31 20m 54s
Add Pallas GPU decode attention in Maxtext inference
Tests #1357: Pull request #1066 synchronize by tohaowu
February 25, 2025 16:22 22m 46s tohaowu-pallas-gpu
February 25, 2025 16:22 22m 46s
Tests
Tests #1356: Scheduled
February 25, 2025 16:04 24m 5s main
February 25, 2025 16:04 24m 5s
Tests
Tests #1355: Scheduled
February 25, 2025 12:06 22m 8s main
February 25, 2025 12:06 22m 8s
Prefix Caching with HBM and latency test
Tests #1354: Pull request #1278 synchronize by yuyanpeng-google
February 25, 2025 09:01 23m 55s yuyan-prefix-cache-dev
February 25, 2025 09:01 23m 55s