Echomimic_v2: Towards Striking, Simplified, and Semi Body Human Animation

Inference time for this example is 8-10 minutes