Load videos longer than 10 seconds Record audio Generate lip-synced videos
For more detailed parameters, please enter the workflow for adjustments

Seed: Random seed for reproducible results (default: 1247)

Lip Expression: Controls the vividness of lip movements (default: 1.5)
High values (2.0, 3.0): More pronounced lip movements, better suited for expressive speech
Low values (1.0, 1.5): More subtle lip movements, better suited for calm speech
This parameter affects the model's guidance scale, balancing between natural motion and lip-sync accuracy

Inference Steps: Number of denoising steps during the inference process (default: 20)
High values (30, 50): Better quality results but slower processing speed
Low values (10, 15): Faster processing speed but potentially lower quality
The default value of 20 usually offers a good balance between quality and speed