feat: Classifer-Free Guidance (take 2) #835

AlpinDale · 2024-11-25T18:35:38Z

Redoing #651

Example usage:

from typing import List
from aphrodite import LLM, SamplingParams
from aphrodite.inputs import PromptInputs

llm = LLM(
    model="NousResearch/Meta-Llama-3.1-8B-Instruct",
    use_v2_block_manager=True,
    cfg_model="NousResearch/Meta-Llama-3.1-8B-Instruct"
)

prompt_pairs = [
    {
        "prompt": "Hello, my name is",
        "negative_prompt": "I am uncertain and confused about who I am"
    },
    {
        "prompt": "The president of the United States is",
        "negative_prompt": "I don't know anything about US politics or leadership"
    },
]

tokenizer = llm.get_tokenizer()

inputs: List[PromptInputs] = [
    {
        "prompt_token_ids": tokenizer.encode(text=pair["prompt"]),
        "negative_prompt_token_ids": tokenizer.encode(text=pair["negative_prompt"])
    }
    for pair in prompt_pairs
]

sampling_params = SamplingParams(guidance_scale=5.0)
outputs = llm.generate(inputs, sampling_params)

TODO:

See if we can skip the double forward pass for the model
Pipe into OpenAI API

AlpinDale · 2024-11-25T22:16:35Z

It works, but will fail due to changed logit shapes when running without CFG.

AlpinDale added 5 commits November 25, 2024 18:22

feat: classifier free guidance - take 2

2242c38

fix

2242975

clean-up and example script

2242cf6

formatting

22425c1

guard against using block manager v1

22425a0

AlpinDale marked this pull request as draft November 25, 2024 22:16

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: Classifer-Free Guidance (take 2) #835

feat: Classifer-Free Guidance (take 2) #835

AlpinDale commented Nov 25, 2024 •

edited

Loading

AlpinDale commented Nov 25, 2024

feat: Classifer-Free Guidance (take 2) #835

Are you sure you want to change the base?

feat: Classifer-Free Guidance (take 2) #835

Conversation

AlpinDale commented Nov 25, 2024 • edited Loading

AlpinDale commented Nov 25, 2024

AlpinDale commented Nov 25, 2024 •

edited

Loading