Skip to content
ActionCartesiaUpdated May 2026

How do I generate speech with Cartesia?

Short answer: Drop the "CartesiaCartesia Sonic TTS" action anywhere in your workflow, map the inputs from upstream nodes, and publish.

Inputs

The fields this action accepts.

Every field can be mapped from an upstream trigger, AI step, table row, or hard-coded literal.

FieldTypeRequiredDescription
Text
transcript
stringRequiredText (required)
Voice ID
voice_id
stringRequiredVoice ID (required)
Model
model_id
optionsOptionalModel. Options: Sonic 2, Sonic
Language
language
stringOptionalLanguage
Sample request
{
"transcript": "{{trigger.transcript}}",
"voice_id": "{{trigger.voice_id}}",
"model_id": "{{trigger.model_id}}",
"language": "{{trigger.language}}"
}
Returns
"<binary>"

Use these fields in downstream nodes for routing, logging, or error handling.

Triggered by

Apps that pair well as the trigger for Cartesia Sonic TTS.

Any of these apps can fire this action as part of a workflow.

FAQ

Questions about Cartesia Sonic TTS.

What does the Cartesia Sonic TTS action do in Cartesia?
Generates audio from text using Cartesia's Sonic model — ultra-low latency (sub-200ms typical), optimised for real-time voice agents. For broadcast-quality narration where peak quality matters more than speed, use ElevenLabs instead.
What inputs does Cartesia Sonic TTS require?
Required: Text, Voice ID. Every input accepts a static value or a variable from any upstream node in your workflow.
Can I use dynamic inputs from earlier workflow nodes?
Yes. Any field on this action can pull values from upstream nodes, whether that's a form response, a trigger payload, an AI output, or a lookup result.
What happens if Cartesia returns an error?
The workflow pauses on the failed node, the error message is captured in the run log, and you can retry the run with one click. Auto-retry policies are configurable per workflow with exponential backoff up to 5 attempts.
Does Cartesia Sonic TTS support batch operations?
Yes. Run Cartesia Sonic TTS inside a Loop node to process arrays. Tiny Command handles Cartesia's rate limits automatically so you don't have to throttle manually.

Send cartesia sonic tts from your workflows.

Triggered by anything in the catalog. Free tier available. No credit card.