You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
@HolyWu Thanks for reporting this. This issue occurs because the bfloat16 weights in the model are converted into FP32 during the conversion phase and hence inputs and weights now have different data types. One approach that seemed to work is casting the weights to BF16 within the converter
To Reproduce
Environment
conda
,pip
,libtorch
, source): pipThe text was updated successfully, but these errors were encountered: