Workspace

MMAudio is a very powerful and practical dubbing workflow.
521
3
26

Audio Generation

MMAudio can generate synchronized audio based on video and/or text inputs. Our key innovation is multimodal joint training, which allows training on a wide range of audio-visual and audio-text datasets. Additionally, the synchronization module aligns the generated audio with video frames.
MMAudio generates synchronized audio given video and/or text inputs. Our key innovation is multimodal joint training which allows training on a wide range of audio visual and audio text datasets. Moreover, a synchronization module aligns the generated audio with the video frames.

521

Download

Open AI App

Folix

2025-01-10 Update

Audio Generation

Folix

2025-01-10 Update

Workflow introduction

Nodes Information

Custom Nodes (7)

MMAudioFeatureUtilsLoader

MMAudioModelLoader

MMAudioSampler

PreviewAudio

VHS_LoadVideo

VHS_VideoCombine

VHS_VideoInfo

MMAudio is a very powerful and practical dubbing workflow. 521326

Audio Generation

MMAudio is a very powerful and practical dubbing workflow.
521
3
26