Use Unsloth FastVisionModel for VLM #7295

PLoic · 2025-03-13T21:18:50Z

What does this PR do?

This PR propose to use unsloth FastVisionModel instead of FastLanguageModel for multimodal model.

hiyouga · 2025-03-13T21:45:37Z

src/llamafactory/model/model_utils/unsloth.py

@@ -48,11 +53,14 @@ def load_unsloth_pretrained_model(
    config: "PretrainedConfig", model_args: "ModelArguments"
 ) -> Optional["PreTrainedModel"]:
    r"""Optionally load pretrained model with unsloth. Used in training."""
-    from unsloth import FastLanguageModel  # type: ignore
+    if is_multimodal(model_args.model_name_or_path):


How about use FastModel? https://github.com/unslothai/unsloth/blob/main/unsloth/models/loader.py#L722-L726

I think for LLM, the recommended class to use is FastLanguageModel, which inherits from FastLlamaModel
(https://github.com/unslothai/unsloth/blob/main/unsloth/models/loader.py#L69C25-L69C40, https://colab.research.google.com/github/unslothai/notebooks/blob/main/nb/Llama3.1_(8B)-Alpaca.ipynb)

hiyouga · 2025-03-14T07:55:59Z

src/llamafactory/model/model_utils/unsloth.py



 def load_unsloth_peft_model(
    config: "PretrainedConfig", model_args: "ModelArguments", is_trainable: bool
 ) -> "PreTrainedModel":
    r"""Load peft model with unsloth. Used in both training and inference."""
-    from unsloth import FastLanguageModel  # type: ignore
+    if is_multimodal(model_args.model_name_or_path):


how about the situation of providing a local path to the model?

Good point, thanks ! I will check for that ASAP

PLoic added 6 commits March 13, 2025 21:19

Update unsloth.py

b11ebd4

Update adapter.py

3c1eb3c

Update unsloth.py

55239d9

Update adapter.py

9bd7e3a

Update unsloth.py

97dafdc

Update unsloth.py

a02c851

hiyouga reviewed Mar 13, 2025

View reviewed changes

hiyouga reviewed Mar 14, 2025

View reviewed changes

PLoic added 6 commits March 14, 2025 20:04

Update constants.py

bddd600

Update unsloth.py

c111ea8

Update unsloth.py

9a04541

Update common.py

895cf13

Update unsloth.py

68d80a6

Update unsloth.py

0b5690b

PLoic marked this pull request as draft March 14, 2025 21:27

PLoic added 2 commits March 15, 2025 21:01

Update constants.py

4c7ea09

Merge branch 'hiyouga:main' into feat/use_unsloth_fastvision

2be4b9e

PLoic marked this pull request as ready for review March 15, 2025 20:03

PLoic requested a review from hiyouga March 17, 2025 08:22

PLoic added 3 commits March 17, 2025 14:11

Merge branch 'hiyouga:main' into feat/use_unsloth_fastvision

6c33601

Update unsloth.py

89500f3

Update unsloth.py

38554dd

PLoic marked this pull request as draft March 18, 2025 10:43

PLoic added 6 commits March 18, 2025 11:49

Update constants.py

737a08d

Update constants.py

3b00676

Update unsloth.py

67d4943

Update unsloth.py

b04f6ff

Update constants.py

bcdd274

Update unsloth.py

81c7822

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Use Unsloth FastVisionModel for VLM #7295

Use Unsloth FastVisionModel for VLM #7295

PLoic commented Mar 13, 2025

hiyouga Mar 13, 2025

PLoic Mar 14, 2025

hiyouga Mar 14, 2025

hiyouga Mar 14, 2025

PLoic Mar 14, 2025

Use Unsloth FastVisionModel for VLM #7295

Are you sure you want to change the base?

Use Unsloth FastVisionModel for VLM #7295

Conversation

PLoic commented Mar 13, 2025

What does this PR do?

hiyouga Mar 13, 2025

Choose a reason for hiding this comment

PLoic Mar 14, 2025

Choose a reason for hiding this comment

hiyouga Mar 14, 2025

Choose a reason for hiding this comment

hiyouga Mar 14, 2025

Choose a reason for hiding this comment

PLoic Mar 14, 2025

Choose a reason for hiding this comment