Skip to main content
MindStudio
Pricing
Blog About
My Workspace
Text to Speech Model

GPT-4o-mini TTS

A lightweight, cost-efficient text-to-speech model from OpenAI that converts written text into natural-sounding audio.

Publisher OpenAI
Type Text to Speech
Context Window 2,000 tokens
Training Data n/a
Price Free/1K chars
LATEST

Lightweight text-to-speech audio generation by OpenAI

GPT-4o-mini TTS is a text-to-speech model developed by OpenAI that converts written text into natural-sounding spoken audio. It belongs to the GPT-4o mini family, which is designed to deliver capable output at a smaller computational footprint than full-scale variants. The model is accessible to developers through the OpenAI API and is intended for programmatic speech generation across a range of applications. It accepts a text input of up to 2,000 characters and returns audio output in a synthesized voice.

GPT-4o-mini TTS is part of OpenAI's broader suite of audio models, which also includes transcription and speech-to-speech capabilities. Its focus is specifically on the text-to-speech task, producing clear and expressive spoken output from plain text. The model is well-suited for voice-enabled applications, accessibility tools, content narration, and any product that requires reliable, scalable audio generation without requiring a larger model. Developers can configure voice selection and other parameters through the API.

What GPT-4o-mini TTS supports

Text to Speech

Converts written text input into synthesized spoken audio output. Accepts up to 2,000 characters of text per request.

Voice Selection

Allows developers to choose from available voice options via a select input parameter in the API request.

Expressive Audio Output

Produces natural-sounding speech with expressive qualities, suitable for narration and voice-enabled interfaces.

API Integration

Available through the OpenAI API, enabling developers to integrate speech generation directly into applications via standard API calls.

Configurable Parameters

Supports numeric and select input types, allowing developers to adjust settings such as speed or format alongside voice and prompt inputs.

Ready to build with GPT-4o-mini TTS?

Get Started Free

Common questions about GPT-4o-mini TTS

What is the input limit for GPT-4o-mini TTS?

The model supports a context window of 2,000 characters, which represents the maximum amount of text that can be submitted in a single request.

How is GPT-4o-mini TTS accessed?

The model is available through the OpenAI API using the model ID gpt-4o-mini-tts. Developers can integrate it into applications by making API calls with a text prompt and optional configuration parameters.

What input types does GPT-4o-mini TTS accept?

The model accepts a text prompt, select inputs for options such as voice choice, and a numeric input for parameters like speed, based on the available input types listed in the metadata.

Does GPT-4o-mini TTS have a training data cutoff date?

A specific training date is not provided for this model. As a text-to-speech model, it does not rely on a knowledge cutoff in the same way that language models do, since it converts text to audio rather than generating factual content.

What types of applications is GPT-4o-mini TTS suited for?

The model is designed for use cases that require programmatic speech generation, including voice-enabled applications, accessibility tools, content narration systems, and other products that need scalable audio output from text input.

What people think about GPT-4o-mini TTS

Community discussion directly about GPT-4o-mini TTS is limited in the found threads, though one Reddit post notes that OpenAI quietly released updated versions of their TTS and related audio models in the API in December 2025, suggesting ongoing development. General sentiment around OpenAI's audio model updates tends to be positive, with developers noting the availability of new versioned releases.

The threads found are largely focused on other TTS models or broad AI release roundups rather than GPT-4o-mini TTS specifically, so detailed community feedback on its limitations or use cases is not well represented in this sample. Developers interested in community discussion may find more targeted feedback in OpenAI developer forums or the OpenAI API changelog.

View more discussions →

Parameters & options

Voice Select

Voice to use in TTS

Default: alloy
AlloyAshCoralEchoFableOnyxNovaSageShimmer
Instructions Prompt

Control the voice of your generated audio with additional instructions.

Speed Number

The speed of the generated audio. (Default is 1.0)

Default: 1 Range: 0.25–4 (step 0.05)
Output Format Select
Default: mp3
MP3WAV

Start building with GPT-4o-mini TTS

No API keys required. Create AI-powered workflows with GPT-4o-mini TTS in minutes — free.