-
Notifications
You must be signed in to change notification settings - Fork 310
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Possible issue with AWQ library when using AWQ models with Ctranslate2 #1821
Comments
Following up... I tried converting an AWQ model here: https://huggingface.co/Qwen/Qwen2.5-7B-Instruct-AWQ The command I used and the error I received were as follows...I'm guessing because of some kind of torch conflict?
|
I do not yet support AWQ for Qwen2. A new pull request is coming soon. |
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Just FYI, I think that the
autoawq
library only supports up to a certain version of thetorch
library pursuant to this message, which I received after (1) installingautoawq
and (2) subsequently installingtorch
version 2.4.0.The reason I installed Torch 2.4.0 is that it supports CUDA 12.4, which, as I understand it, is the version that officially supports CUDNN 9+, which, in turn,
ctranslate2
4.5.0 requires.Correct me if I'm mistaken...
It's a little confusing because the autoawq repository has this portion in their readme.md:
Furthermore, when I pip install
autoawq
it installsautoawq-kernels
by default. This somewhat conflicts with the readme.md, which states thattriton
is used by default...The repository for the "kernels" is here: https://github.com/casper-hansen/AutoAWQ_kernels
It states that you can build this specific library based on torch version and cuda compute level so...Perhaps it's just a matter of what wheels are are available by default (and whether they're compatible with
ctranslate2
) versus building the library, which goes above and beyond the available wheels...Anyhow, just FYI, because if the AWQ library requires a specific version of
torch
and it doesn't overlap with ctranslate2's requirements, it could create a serious conflict.The text was updated successfully, but these errors were encountered: