Pin 2024 03 27 #15

mars1248 · 2024-03-27T06:30:38Z

Pin 2024 03 27
TORCHXLA_PIN_COMMIT=27ae6dc955734b223e2b93e1c5bbcd681b0537fd

…rch#6268) This commit reenables test/test_zero1.py for GPU. It is passing with latest torch/xla (commit e1c94df). pytorch#6260

…orch#6270)

This is useful for scoping up the single graph, and seeing how compiler is able to optimise it. This is is optional i.e not added by default. Once we start setting up another benchmark suite with profiling info etc. this should be run additionally, but separately so CUPTI interface won't interact with pure CUDA events. We do not measure compilation time for this option yet.

With the pure wall time infrastructure came nice ability to do microbenchmarks easily. I added the sample microbenchmark for matmul.

…ytorch#6276)

This reverts commit 0c81365. It causes OOMs for some training workloads on GPUs, and slows down others.

…rch#6178)

…rch#6282)

…ytorch#6290)

… to HLO (pytorch#6283)

pytorch#6293)

This PR introduces, and integrates a verification module under the --verify flag. The verification module gets model output as an input, model/benchmark reconstruction args, then runs the eager and finally calculates the relative error and compares it against the provided threshold to return the appropriate error code.

mbzomowski and others added 30 commits January 8, 2024 10:21

Add workflow file for TPU CI (pytorch#6198)

5801f7e

remove model training using openxla backend (pytorch#6262)

a89ebef

test_zero1 passes on GPU with latest torch/xla (commit e1c94df) (pyto…

a60f8e7

…rch#6268) This commit reenables test/test_zero1.py for GPU. It is passing with latest torch/xla (commit e1c94df). pytorch#6260

Set min TPU nodes to 0 for TPU CI & disable workflow run on push (pyt…

257f0f5

…orch#6270)

Add matmul microbenchmark. (pytorch#6259)

88d97cc

With the pure wall time infrastructure came nice ability to do microbenchmarks easily. I added the sample microbenchmark for matmul.

Revert "remove model training using openxla backend (pytorch#6262)" (p…

551c0df

…ytorch#6276)

Fix a typo in aten_xla_type.cpp (pytorch#6271)

f2416cf

Revert "Remove reshape lowering (pytorch#6230)" (pytorch#6280)

c3a5909

This reverts commit 0c81365. It causes OOMs for some training workloads on GPUs, and slows down others.

Limit number of bazel jobs in the build (pytorch#6279)

9a4ef68

Adapt a few torch.utils.checkpoint functions for PyTorch/XLA. (pyto…

ebb200b

…rch#6178)

Remove or simplify hardcoded lists of device types (pytorch#6235)

050a240

Backport PjRtStreamExecutorLoadedExecutable:GetCompileOptions (pyto…

235b82b

…rch#6282)

Temporarily disable GPU coverage workflow in CI (pytorch#6285)

83b7571

Add CPU PJRT plugin for testing (pytorch#6253)

2f1334d

Update example Kaggle notebook to PyTorch/XLA 2.1 (pytorch#6287)

0e735de

Update configuration.yaml to include XLA_DISABLE_FUNCTIONALIZATION (p…

4b1eaee

…ytorch#6290)

Fix some more core aten opset tests (pytorch#6284)

9f1afbd

Add single graph option as a runnable script. (pytorch#6277)

ddf4be2

Fix output type of custom calls while lowering quant/dequant torch op…

68f4750

… to HLO (pytorch#6283)

Set XLA_USE_SPMD for spmd cpp tests. (pytorch#6273)

a728afe

Ignore non-XLA nodes and their direct dependents. (pytorch#6170)

8141078

[Core Aten] Add and enable tests for aten index_select and logical_and (

04e1238

pytorch#6293)

Support of implicit broadcasting with unbounded dynamism (pytorch#6219)

896de17

Enable Tensor index check in both dimension (pytorch#6299)

2850288

Correct order of constant vs args (pytorch#6300)

5517779

Fix some more aten core ops (pytorch#6301)

5d0de94

fix gpu spmd core dump (pytorch#6275)

de27258

Fix global_runtime_device_count() for spmd case (pytorch#6266)

27ae6dc

Merge commit '27ae6dc955734b223e2b93e1c5bbcd681b0537fd'

0afee95

zjjott merged commit 7b8d407 into master Mar 27, 2024
0 of 2 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Pin 2024 03 27 #15

Pin 2024 03 27 #15

mars1248 commented Mar 27, 2024

Pin 2024 03 27 #15

Pin 2024 03 27 #15

Conversation

mars1248 commented Mar 27, 2024