How to generate speech (tts) in OpenAI

Generate Speech (TTS) in OpenAI — start free

Inputs

The fields this action accepts.

Every field can be mapped from an upstream trigger, AI step, table row, or hard-coded literal.

Field	Type	Required	Description
Text input	string	Required	Text
Model model	options	Required	Model. Options: TTS-1, TTS-1 HD
Voice voice	options	Required	Voice. Options: Alloy, Echo, Fable, Onyx, Nova, Shimmer
Format response_format	options	Optional	Format. Options: MP3, Opus, AAC, FLAC

Sample request

{
  "input": "{{trigger.input}}",
  "model": "{{trigger.model}}",
  "voice": "{{trigger.voice}}",
  "response_format": "{{trigger.response_format}}"
}

Returns

{
  "success": true
}

Use these fields in downstream nodes for routing, logging, or error handling.

FAQ

Questions about Generate Speech (TTS).

What does the Generate Speech (TTS) action do in OpenAI?

Converts text to speech audio using OpenAI's TTS models, returning audio bytes (mp3 by default). Supports multiple voices (alloy, echo, fable, onyx, nova, shimmer).

What inputs does Generate Speech (TTS) require?

Required: Text, Model, Voice. Every input accepts a static value or a variable from any upstream node in your workflow.

Can I use dynamic inputs from earlier workflow nodes?

Yes. Any field on this action can pull values from upstream nodes, whether that's a form response, a trigger payload, an AI output, or a lookup result.

What happens if OpenAI returns an error?

The workflow pauses on the failed node, the error message is captured in the run log, and you can retry the run with one click. Auto-retry policies are configurable per workflow with exponential backoff up to 5 attempts.

Does Generate Speech (TTS) support batch operations?

Yes. Run Generate Speech (TTS) inside a Loop node to process arrays. TinyCommand handles OpenAI's rate limits automatically so you don't have to throttle manually.

More actions