MiniMax Speech 2.6 HD is a professional-grade TTS engine optimized for ultra-low latency and exceptional naturalness. Featuring a major normalization upgrade, it delivers crisp articulation and fluid rhythm across 40+ global languages, including specialized dialects. The model excels in maintaining cross-lingual similarity and accent fidelity, preserving "age" timbre and regional nuances with high precision. Designed for real-time streaming, it ensures seamless audio generation for live meetings and podcasts, creating an immersive, lifelike interactive experience.
Request
Authorization
Header Params
Body Params application/jsonRequired
Example
{"text":"我们的 Speech 2.6 HD 模型现已支持超过 40 种语言。 比如,当我说到“第 12,800 个并发节点”时,它的吐字依然如此清晰。归一化升级带来的高自然度,让即便是在吉隆坡 或特拉维夫 的听众,也能感受到家乡般的亲切。这种超低延时的表现,真是令人惊叹。","pronunciation_dict":["ASAP/As soon as possible"],"voice_id":"Wise_Woman","speed":1,"volume":1,"pitch":0,"emotion":"happy","enable_base64_output":false,"english_normalization":false}