Latest digital human Multitalk Image Audio = High-quality digital human, supports singing lip-sync
2471
20
72

D-Human

Image-to-Video

Super powerful latest digital human MultiTalk, high-quality digital humans can be generated with images and audio, with excellent speaking and singing quality

Various optimizations in this workflow:

1: One-click intelligent expansion of prompt words by large models, generating better quality

2: Can set the maximum video side length, keeping the original width and height if not exceeded

3: Supports intelligent audio clipping, 0 means default length same as the original video

Recommended to use 48G VRAM for better results, default side length is 832, 11 seconds require 7-10 minutes processing time

2471

Download

Open AI App

嘟嘟AI绘画趣味学

2025-06-21 Update

D-Human

Image-to-Video

嘟嘟AI绘画趣味学

2025-06-21 Update

Workflow introduction

Super powerful latest digital human MultiTalk, high-quality digital humans can be generated with images and audio, with excellent speaking and singing quality

Various optimizations in this workflow:

1: One-click intelligent expansion of prompt words by large models, generating better quality

2: Can set the maximum video side length, keeping the original width and height if not exceeded

3: Supports intelligent audio clipping, 0 means default length same as the original video

Recommended to use 48G VRAM for better results, default side length is 832, 11 seconds require 7-10 minutes processing time

Nodes Information

Primitive Nodes (2)

CLIPVisionLoader

LoadImage

Custom Nodes (45)

Audio Duration (mtb)

AudioCrop

AudioSeparation

CR Text

DownloadAndLoadWav2VecModel

GetNode

ImageFromBatch+

ImpactSwitch

JWInteger

LayerUtility: ImageScaleByAspectRatio V2

LoadAudio

LoadWanVideoT5TextEncoder

MathExpression|pysssss

MultiTalkModelLoader

MultiTalkWav2VecEmbeds

Note

RHHiddenNodes

RH_LLMAPI_NODE

SetNode

VHS_DuplicateImages

VHS_VideoCombine

WanVideoApplyNAG

WanVideoBlockSwap

WanVideoClipVisionEncode

WanVideoContextOptions

WanVideoDecode

WanVideoEncode

WanVideoEnhanceAVideo

WanVideoExperimentalArgs

WanVideoImageToVideoEncode

WanVideoLoraSelect

WanVideoModelLoader

WanVideoSLG

WanVideoSampler

WanVideoTextEncodeSingle

WanVideoTorchCompileSettings

WanVideoUni3C_ControlnetLoader

WanVideoUni3C_embeds

WanVideoVAELoader

easy cleanGpuUsed

easy compare

easy ifElse

easy imageScaleDownToSize

easy imageSize

easy showAnything

Latest digital human Multitalk Image Audio = High-quality digital human, supports singing lip-sync 24712072

D-Human

Image-to-Video

Latest digital human Multitalk Image Audio = High-quality digital human, supports singing lip-sync
2471
20
72