YAML Metadata Warning: empty or missing yaml metadata in repo card (https://huggingface.co/docs/hub/model-cards#model-card-metadata)

LTX2 simple i2v.

Uses res2s and usually lower distill strength with fp8 undistilled model, 0.75-0.9 distilled lora (rank175 seems best) strength depending on application for first pass. Stack as many loras as you can even if barely related to the concept, lora stack drowns out the base model noise and makes the output more stable.

Use 33-38 preprocess node compression strength to increase motion. Get the best motion and audio from the first pass, cancel and reroll seed if bad. If preview is kind of good stop it and refine with prompt and lora weight mixes.

Second pass uses audio directly from first seed to track and half strength distilled upscale pass based on the full size input image for max quality. Only way to get very good clear visuals is with the half distill, but passing audio latent into the half distilled sampler ruins it, so this is the neat trick.

If you wanna use this T2V, just click the bypass on the LTXVImgToVideoInplace node.

Downloads last month

-

Downloads are not tracked for this model. How to track
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support