OpenAI

GPT-4o Mini TTS

Name: OpenAI GPT-4o Mini TTS
Author: OpenAI

Try It Compare

Model ID:gpt-4o-mini-tts-2025-12-15

2025-12-15

Try It Compare

GPT-4o Mini TTS is OpenAI's cost-efficient text-to-speech model, built on the GPT-4o Mini architecture. It converts written text into natural-sounding, expressive spoken audio with high steerability — developers can control speech characteristics such as tone, pacing, and emphasis through natural language instructions in the prompt. Compared to previous OpenAI TTS models, it delivers significantly lower word error rates and more natural prosody, making it ideal for voice agents, accessibility features, and audio content production at scale.

API|Proprietary Model

Knowledge Cutoff

Unknown

The date this AI finished learning. It may not know about things that happened after this date.

Input → Output Format

The types of content this AI can receive, and what it can produce in return.

Context Memory

The maximum amount of text the AI can read and process in a single request. A larger number means it can handle longer documents or conversations.

Cost/1M Words

$0.6IN$12OUT

The cost of using this AI directly in your own application. Shown in USD per 1 million units of text (tokens).

Calculate Cost