A text-to-speech generation model from Kling AI, supporting multilingual and multi-dialect synthesis . It generates online voiceovers from text descriptions or replicates specific voices through custom voice features. Supports speech speed adjustment (0.8-2x), multiple emotional style selections, and integrates with the lip-sync model to achieve audio-visual synchronized lip driving.
Request
Authorization
Header Params
Body Params application/jsonRequired
Example
{"text":" Welcome to Kling Lip Sync Model. Generate lip-synchronized videos based on face recognition results and audio input, maintaining character consistency and creating your desired audio for the character to lip-sync. This video was generated using the Kling Lip Sync Model.","voiceId":"genshin_klee2","voiceLanguage":"zh","voiceSpeed":1}
Request Code Samples
Shell
JavaScript
Java
Swift
Go
PHP
Python
HTTP
C
C#
Objective-C
Ruby
OCaml
Dart
R
Request Request Example
Shell
JavaScript
Java
Swift
curl--location--request POST 'https://www.runninghub.ai/openapi/v2/kling-lip-sync/tts' \
--header'Authorization: Bearer [Your API KEY]' \
--header'Authorization: Bearer [Your API KEY]' \
--header'Content-Type: application/json' \
--data-raw'{
"text": " Welcome to Kling Lip Sync Model. Generate lip-synchronized videos based on face recognition results and audio input, maintaining character consistency and creating your desired audio for the character to lip-sync. This video was generated using the Kling Lip Sync Model.",
"voiceId": "genshin_klee2",
"voiceLanguage": "zh",
"voiceSpeed": 1
}'