Simply upload an image and an audio clip to generate a cinematic and dynamic video. Currently, the English version delivers the best results.