-
Notifications
You must be signed in to change notification settings - Fork 56
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Mask guidance, inpaiting and outpaiting #49
Comments
The autoregreesive model acutally is not really good for mask guidance image generation in my opinion |
@daiyixiang666 theoretically i don't understand why it should perform badly, depends on how you are doing the conditioning. |
I think autoregreesive model perform better when align with language model |
If the mask is just like casual mask I think it will be great, but I do not think we always has the casual mask in real life |
@iFighting While it's true that autoregressive models can generate high-quality images without relying on text-based or mask-based conditioning, it's important to acknowledge that diffusion-based models like DiT have also demonstrated impressive results. However, diffusion models do face challenges when it comes to text-based and mask-based conditioning. |
For control generation (reference): https://arxiv.org/pdf/2406.09750 |
We are releasing code recently. If you are interested in controllable AR generation, please keep an eye on https://github.com/lxa9867/ControlVAR. |
Thanks for the awesome paper. Even the codebase is very easy to use.
Can you please do some initial experiments on mask guidance image generation, inpainting and outpainting. It will really help the community.
The text was updated successfully, but these errors were encountered: