Fixed the model conversion bug caused by minicpm's GQA structure。After testing minicpm's GQA, the converted model generates all <h>. This is because the number of k and v matrices of Gqa should be the same as kv_head, not the same as head/kv_head. #8249

LDLINGLINGLING · 2024-07-02T07:45:01Z

I have read the contributing guidelines
Self-reported review complexity:
- Low
- Medium
- High

ngxson · 2024-07-02T11:04:13Z

Does this only affect minicpm 2, or both version 1 and 2?

LDLINGLINGLING · 2024-07-02T11:59:11Z

only minicpm 2,Because only minicpm2 started using gqa

ngxson · 2024-07-02T13:23:10Z

Changing this will break minicpm 1, we should add a check somewhere to detect if the input model is minicpm 1 or 2

LDLINGLINGLING · 2024-07-03T01:36:25Z

Is the 1 you mentioned cpm-bee? This model has never been supported. Which model did you crash? Can I check it out?

compilade · 2024-07-04T14:45:06Z

Seems to be the same as #7967, therefore my comment from there applies here too.

This makes MiniCPMModel._reverse_hf_permute exactly equivalent to LlamaModel.permute. Should LlamaModel.permute (which is also a static method) be used instead in MiniCPMModel?

ngxson · 2024-07-04T15:03:53Z

@LDLINGLINGLING I saw that support for minicpm is added a long time ago, so I supposed it was version 1.

But to be honest, there have been quite a lot of versions of minicpm so I'm not even sure what we're talking about. Could you specify which version you've tested?

A perplexity test would be appreciated.

LDLINGLINGLING · 2024-07-05T04:15:29Z

Hello, I just found that the code I uploaded before is wrong. The new commit has tested all minicpm, and the results are normal. The code in the original llama.cpp cannot effectively convert the minicpm of the 1b parameter. , it is worth noting that in your figure, the model with the letter v is multi-modal, which is not supported by llama.cpp.

修改了由于minicpm的GQA结构带来的模型转换bug

47d821a

github-actions bot added the python python script changes label Jul 2, 2024

LDLINGLINGLING changed the title ~~修改了由于minicpm的GQA结构带来的模型转换bug~~ Fixed the model conversion bug caused by minicpm's GQA structure Jul 2, 2024

mofosyne added the Review Complexity : Low Trivial changes to code that most beginner devs (or those who want a break) can tackle. e.g. UI fix label Jul 3, 2024

fix bug of minicpm1b,minicpm2b

a3efa29

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Fixed the model conversion bug caused by minicpm's GQA structure。After testing minicpm's GQA, the converted model generates all <h>. This is because the number of k and v matrices of Gqa should be the same as kv_head, not the same as head/kv_head. #8249

Fixed the model conversion bug caused by minicpm's GQA structure。After testing minicpm's GQA, the converted model generates all <h>. This is because the number of k and v matrices of Gqa should be the same as kv_head, not the same as head/kv_head. #8249

Uh oh!

LDLINGLINGLING commented Jul 2, 2024

Uh oh!

ngxson commented Jul 2, 2024

Uh oh!

LDLINGLINGLING commented Jul 2, 2024

Uh oh!

ngxson commented Jul 2, 2024

Uh oh!

LDLINGLINGLING commented Jul 3, 2024

Uh oh!

compilade commented Jul 4, 2024

Uh oh!

ngxson commented Jul 4, 2024 •

edited

Loading

Uh oh!

LDLINGLINGLING commented Jul 5, 2024

Uh oh!

Uh oh!

Fixed the model conversion bug caused by minicpm's GQA structure。After testing minicpm's GQA, the converted model generates all <h>. This is because the number of k and v matrices of Gqa should be the same as kv_head, not the same as head/kv_head. #8249

Are you sure you want to change the base?

Fixed the model conversion bug caused by minicpm's GQA structure。After testing minicpm's GQA, the converted model generates all <h>. This is because the number of k and v matrices of Gqa should be the same as kv_head, not the same as head/kv_head. #8249

Uh oh!

Conversation

LDLINGLINGLING commented Jul 2, 2024

Uh oh!

ngxson commented Jul 2, 2024

Uh oh!

LDLINGLINGLING commented Jul 2, 2024

Uh oh!

ngxson commented Jul 2, 2024

Uh oh!

LDLINGLINGLING commented Jul 3, 2024

Uh oh!

compilade commented Jul 4, 2024

Uh oh!

ngxson commented Jul 4, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

LDLINGLINGLING commented Jul 5, 2024

Uh oh!

Uh oh!

ngxson commented Jul 4, 2024 •

edited

Loading