Qwen3 TTS Two-Person/Single-Person Dialogue Audio Generation (Supports Uploading Voice Cloning or Generating Voice Based on Text Description)

Each role has two options to choose from【Generate voice based on text description】/【Use uploaded voice】:
  • Option 1: Generate voice based on your text description and then speak the lines you provide
  • Option 2: Clone the voice you upload and then speak the lines you provide


AI application:

  • AI application can be selected through the dropdown options. After choosing one option, the parameters for the other option will become invalid and can be ignored.


Workflow:

  • In the workflow, control the options via the pink switch;
  • The fluorescent green nodes are input parameters, adjust them as needed, other nodes do not need modification.


It takes less than one minute, and single-person speech is also supported.
The final output is a segment of audio.

Note: The character names in the complete lines must match the names you assign to each character.


👇Qwen3 TTS Dialogue Audio Generation for 8 or fewer people (Supports Uploading Voice Cloning or Generating Voice Based on Text Description)
https://www.runninghub.cn/post/2017578327875264513?inviteCode=ishbfzc1


👇(Used in combination) LTX2.0 First and Last Frame Video V3 Audio-Driven Version_Three-Time Sampling (Upload audio independently)

https://www.runninghub.cn/post/2017697617920135169?inviteCode=ishbfzc1