idu-Text-to-Video-q3-turbo is a pioneering "Drama-First" engine designed for industrial-grade content production. It is the first to achieve 16-second synchronized audio-visual output, allowing for complete narrative arcs within a single shot. Featuring a "Director's Mindset," it automatically manages camera cuts and synchronizes dialogue, ambient sounds, and emotional beats in real-time. The turbo version optimizes generation speed while maintaining cinematic fidelity, transforming AI video from a creative toy into a robust narrative productivity tool.
Request
Authorization
Header Params
Body Params application/jsonRequired
Example
{"prompt":"Visual: Wide shot: A Chinese sci-fi city at sunset with neon lights flickering on. Cut to medium shot: A young boy wearing smart glasses walking through a bustling night market. Cut to close-up: The boy stops, smiles at the camera, and clearly says, \"Welcome to Shanghai in 2077.\" Floating blue holographic Chinese text \"上海·未来之城\" (Shanghai: Future City) naturally appears on the right. Audio: Ambient crowd noise and low-frequency aircraft hum. The boy’s voice is energetic, with lips perfectly synced to the dialogue, ending with a crisp electronic chime.","style":"general","aspectRatio":"16:9","resolution":"720p","duration":"5","audio":true}
Request Code Samples
Shell
JavaScript
Java
Swift
Go
PHP
Python
HTTP
C
C#
Objective-C
Ruby
OCaml
Dart
R
Request Request Example
Shell
JavaScript
Java
Swift
curl--location'https://www.runninghub.ai/openapi/v2/vidu/text-to-video-q3-turbo' \
--header'Authorization: Bearer [Your API KEY]' \
--header'Authorization: Bearer [Your API KEY]' \
--header'Content-Type: application/json' \
--data'{
"prompt": "Visual: Wide shot: A Chinese sci-fi city at sunset with neon lights flickering on. Cut to medium shot: A young boy wearing smart glasses walking through a bustling night market. Cut to close-up: The boy stops, smiles at the camera, and clearly says, \"Welcome to Shanghai in 2077.\" Floating blue holographic Chinese text \"上海·未来之城\" (Shanghai: Future City) naturally appears on the right. Audio: Ambient crowd noise and low-frequency aircraft hum. The boy’s voice is energetic, with lips perfectly synced to the dialogue, ending with a crisp electronic chime.",
"style": "general",
"aspectRatio": "16:9",
"resolution": "720p",
"duration": "5",
"audio": true
}'