Changes
- llama.cpp updated to b4562
- Added support MiniCPM-1B, Minicpm-omni, Deepseek-R1-Qwen distill, PhiMoE, DeepSeek V3 models
- Added supprot Minerva 7B, Deepseek MoE v1 & GigaChat models, Qwen2VL, Falcon3 models
- Added support InfiniAI Megrez 3b, OLMo, QRWKV6, Llama-3_1-Nemotron models
- Metal improvements
- Fixed some errors