Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

failed to translate executables for sharkai models #19874

Open
ziereis opened this issue Feb 1, 2025 · 0 comments
Open

failed to translate executables for sharkai models #19874

ziereis opened this issue Feb 1, 2025 · 0 comments
Labels
bug 🐞 Something isn't working

Comments

@ziereis
Copy link
Contributor

ziereis commented Feb 1, 2025

What happened?

cant compile 3b open llama model.

cuda:

failed to translate executables
failed to translate executables
shark_models/shark_llama_3b_bs1.mlir:1121:12: error: failed to run translation of source executable to target executable for backend #hal.executable.target<"cuda", "cuda-nvptx-fb", {iree.gpu.target = #iree_gpu.target<arch = "sm_80", features = "+ptx76", wgp = <compute =  fp64|fp32|fp16|int64|int32|int16|int8, storage =  b64|b32|b16|b8, subgroup =  shuffle|arithmetic, dot =  dp4xi8toi32, mma = [<NV_WMMA_F32_16x16x16_F16>, <NV_WMMA_F16_16x16x16_F16>], subgroup_size_choices = [32], max_workgroup_sizes = [1024, 1024, 1024], max_thread_count_per_workgroup = 1024, max_workgroup_memory_bytes = 166912, max_workgroup_counts = [2147483647, 65535, 65535]>>}>
    %378 = torch.aten.transpose.int %377#0, %int1_170, %int2_171 : !torch.vtensor<[1,32,?,100],f16>, !torch.int, !torch.int -> !torch.vtensor<[1,?,32,100],f16>

for hip the compilation seems to be stuck somewhere, didn't finish compilation after roughly 20 minutes.

Steps to reproduce your issue

model exported with:

python -m sharktank.examples.export_paged_llm_v1 --hf-dataset=SlyEcho/open_llama_3b_v2_f16_gguf --bs=1

failed compilation:

iree-compile --iree-hal-target-device=cuda -iree-cuda-target=sm_80 

compilation either takes very long or is stuck somewhere, tried running it for 20 minutes:

iree-compile --iree-hal-target-device=hip --iree-hip-target=gfx90a

What component(s) does this issue relate to?

Compiler

Version information

4693b1c

Additional context

No response

@ziereis ziereis added the bug 🐞 Something isn't working label Feb 1, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug 🐞 Something isn't working
Projects
None yet
Development

No branches or pull requests

1 participant