MiniMax Speech-02-Turbo is a high-performance TTS model built for speed and rhythmic precision. It maintains superior stability and rhythm while featuring enhanced multilingual capabilities, providing a seamless experience for global applications. With 17+ preset voices and support for custom voice cloning, it allows for highly personalized and emotionally resonant audio production. Its excellent performance ensures low-latency generation without sacrificing the natural, human-like intonation required for professional-grade content creation.
Request
Authorization
Header Params
Body Params application/jsonRequired
Example
{"text":"Hello! Welcome to the new era of high-speed synthesis! The rhythmic stability of Speech-02-Turbo remains unmatched even in complex scripts.","pronunciation_dict":["ASAP/As soon as possible"],"voice_id":"Energetic_Girl","speed":1,"volume":1,"pitch":0,"emotion":"happy","enable_base64_output":false,"english_normalization":false}
Request Code Samples
Shell
JavaScript
Java
Swift
Go
PHP
Python
HTTP
C
C#
Objective-C
Ruby
OCaml
Dart
R
Request Request Example
Shell
JavaScript
Java
Swift
curl--location--request POST 'https://www.runninghub.ai/openapi/v2/rhart-audio/text-to-audio/speech-02-turbo' \
--header'Authorization: Bearer [Your API KEY]' \
--header'Authorization: Bearer [Your API KEY]' \
--header'Content-Type: application/json' \
--data-raw'{
"text": "Hello! Welcome to the new era of high-speed synthesis! The rhythmic stability of Speech-02-Turbo remains unmatched even in complex scripts.",
"pronunciation_dict": [
"ASAP/As soon as possible"
],
"voice_id": "Energetic_Girl",
"speed": 1,
"volume": 1,
"pitch": 0,
"emotion": "happy",
"enable_base64_output": false,
"english_normalization": false
}'