align xpu behavior w/ cuda #2551

yao-matrix · 2025-05-21T04:46:36Z

for lorafa and randlora: i can see peft requirement torch >=1.13, and in 1.13, torch already has a device agnostic torch.autocast, switch to use the device agnostic API to also cover xpu
clean codes in tests folder to use device agnostic clean cache API. Before this PR, some test cases use device-agnostic clean cache API, some use torch.cuda.xx; after this PR, all use device-agnostic clean cache API
enable gptqmodel multi-device test case on XPU, enable torchao test cases on XPU

Signed-off-by: Matrix Yao <[email protected]>

Signed-off-by: YAO Matrix <[email protected]>

yao-matrix · 2025-05-21T04:48:19Z

@githubnemo , pls help review, thx very much.

yao-matrix · 2025-05-26T02:07:53Z

@IlyasMoutawwakil , could you pls help review? Thx

yao-matrix · 2025-05-26T22:35:30Z

@IlyasMoutawwakil , do you know who can help review and merge the PR for peft repo? Thx very much.

Signed-off-by: YAO Matrix <[email protected]>

Signed-off-by: Matrix YAO <[email protected]>

2. refine skip message Signed-off-by: Matrix YAO <[email protected]>

Signed-off-by: Matrix YAO <[email protected]>

yao-matrix · 2025-05-28T01:34:10Z

tests/testing_utils.py

@@ -78,7 +78,7 @@ def require_torch_multi_gpu(test_case):
        return test_case


-def require_multi_accelerator(test_case):
+def require_torch_multi_accelerator(test_case):


this is actually for torch multi-accelerator, since it uses torch_device, change the name to reflect the fact and align the name convention w/ require_torch_multi_gpu.

src/peft/optimizers/lorafa.py

IlyasMoutawwakil

LGTM, just one nit.

HuggingFaceDocBuilderDev · 2025-05-28T14:40:57Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

githubnemo

I agree with @IlyasMoutawwakil, it would be best to use a helper function to determine if bfloat16 is available.

LGTM otherwise :)

Signed-off-by: Matrix YAO <[email protected]>

yao-matrix · 2025-05-29T01:49:10Z

@IlyasMoutawwakil @githubnemo i've updated the PR per your comments, pls help review and comment. and i've checked the ci failure, seems not related to my changes, thx very much.

yao-matrix added 2 commits May 21, 2025 04:43

align xpu behavior w/ CUDA in lorafa

3fe3c0b

Signed-off-by: Matrix Yao <[email protected]>

fix style

f88ee86

Signed-off-by: YAO Matrix <[email protected]>

IlyasMoutawwakil approved these changes May 26, 2025

View reviewed changes

yao-matrix added 7 commits May 27, 2025 16:56

randlora default dtype to bfloat16, align CUDA behavior

74cae57

Signed-off-by: YAO Matrix <[email protected]>

refine randlora&vblora test, refine bnb test skip message

40df734

Signed-off-by: Matrix YAO <[email protected]>

1. use device agnostic cache clean API

0c62ded

2. refine skip message Signed-off-by: Matrix YAO <[email protected]>

refine

524ec49

Signed-off-by: Matrix YAO <[email protected]>

rename to torch_multi_accelerator

2f9bbb1

Signed-off-by: Matrix YAO <[email protected]>

enable torchao tests on XPU, all passed on torchao 0.11.0

9b0a338

Signed-off-by: Matrix YAO <[email protected]>

fix style

d718677

Signed-off-by: Matrix YAO <[email protected]>

yao-matrix changed the title ~~align xpu behavior w/ CUDA in lorafa~~ align xpu behavior w/ cuda May 28, 2025

yao-matrix commented May 28, 2025

View reviewed changes

IlyasMoutawwakil reviewed May 28, 2025

View reviewed changes

src/peft/optimizers/lorafa.py Outdated Show resolved Hide resolved

IlyasMoutawwakil approved these changes May 28, 2025

View reviewed changes

IlyasMoutawwakil mentioned this pull request May 28, 2025

TST simplify clean device cache callings and make it device agnostic #2547

Closed

githubnemo reviewed May 28, 2025

View reviewed changes

yao-matrix added 2 commits May 29, 2025 00:45

use accelerate utils

3bcaf65

Signed-off-by: Matrix YAO <[email protected]>

fix style

51d29ae

Signed-off-by: Matrix YAO <[email protected]>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

align xpu behavior w/ cuda #2551

align xpu behavior w/ cuda #2551

Uh oh!

yao-matrix commented May 21, 2025 •

edited

Loading

Uh oh!

yao-matrix commented May 21, 2025

Uh oh!

yao-matrix commented May 26, 2025

Uh oh!

yao-matrix commented May 26, 2025

Uh oh!

yao-matrix May 28, 2025

Uh oh!

Uh oh!

IlyasMoutawwakil left a comment

Uh oh!

HuggingFaceDocBuilderDev commented May 28, 2025

Uh oh!

githubnemo left a comment

Uh oh!

yao-matrix commented May 29, 2025 •

edited

Loading

Uh oh!

Uh oh!

align xpu behavior w/ cuda #2551

Are you sure you want to change the base?

align xpu behavior w/ cuda #2551

Uh oh!

Conversation

yao-matrix commented May 21, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

yao-matrix commented May 21, 2025

Uh oh!

yao-matrix commented May 26, 2025

Uh oh!

yao-matrix commented May 26, 2025

Uh oh!

yao-matrix May 28, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

IlyasMoutawwakil left a comment

Choose a reason for hiding this comment

Uh oh!

HuggingFaceDocBuilderDev commented May 28, 2025

Uh oh!

githubnemo left a comment

Choose a reason for hiding this comment

Uh oh!

yao-matrix commented May 29, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

yao-matrix commented May 21, 2025 •

edited

Loading

yao-matrix commented May 29, 2025 •

edited

Loading