Skip to content

Latest commit

 

History

History
18 lines (13 loc) · 883 Bytes

latte.md

File metadata and controls

18 lines (13 loc) · 883 Bytes

Latte Performance

Chinese Version

Latte is a text-to-video model, and xDiT currently implements parallel inference acceleration for it using the USP method. PipeFusion is under development.

On an 8xL20 (PCIe) machine, the latency performance for generating 512x512x16 videos is shown in the graph below.

latency-latte-l20-512

The latency performance for generating 1024x1024x16 videos is depicted in the following graph. Using mixed sequence parallelization (ulysses_degree=2, ring_degree=4) yields the best performance.

latency-latte-l20-1024