ActionElevenLabsUpdated July 2026

How do I transcribe audio with ElevenLabs?

Short answer: You can speech to text in ElevenLabs by hand from its own interface, but it won’t repeat itself. On TinyCommand, add the ElevenLabs Speech to Text action to a workflow, map its 5 inputs from any upstream app, and it runs automatically every time the trigger fires. No code, and a free tier to start.

Speech to Text in ElevenLabs — start free All 7 ElevenLabs actions

Speech to Text in ElevenLabs — start free

Inputs

The fields this action accepts.

Every field can be mapped from an upstream trigger, AI step, table row, or hard-coded literal.

Field	Type	Required	Description
Audio File URL file_url	string	Required	Audio File URL (required)
Model model_id	options	Optional	Model. Options: Scribe v1
Language Code language_code	string	Optional	en (auto-detect if blank)
Speaker Diarization diarize	options	Optional	Speaker Diarization. Options: No, Yes
Timestamp Granularity timestamps_granularity	options	Optional	Timestamp Granularity. Options: None, Word, Character

Sample request

{
  "file_url": "e.g. https://example.com/path",
  "model_id": "{{trigger.model_id}}",
  "language_code": "en (auto-detect if blank)",
  "diarize": "{{trigger.diarize}}",
  "timestamps_granularity": "{{trigger.timestamps_granularity}}"
}

Returns

{
  "text": "Hello world",
  "words": [
    {
      "end": 0.4,
      "text": "Hello",
      "type": "word",
      "start": 0
    }
  ],
  "language_code": "en",
  "language_probability": 0.99
}

Use these fields in downstream nodes for routing, logging, or error handling.

Triggered by

Apps that pair well as the trigger for Speech to Text.

Any of these apps can fire this action as part of a workflow.

Google Sheets → ElevenLabs

2 Google Sheets triggers

HubSpot → ElevenLabs

18 HubSpot triggers

FAQ

Questions about Speech to Text.

What does the Speech to Text action do in ElevenLabs?

Transcribes audio using ElevenLabs' speech recognition. While ElevenLabs is better known for TTS, their STT is competitive with Deepgram/AssemblyAI for specific use cases. Useful for unified ElevenLabs-only voice-agent workflows.

What inputs does Speech to Text require?

Required: Audio File URL. Every input accepts a static value or a variable from any upstream node in your workflow.

Can I use dynamic inputs from earlier workflow nodes?

Yes. Any field on this action can pull values from upstream nodes, whether that's a form response, a trigger payload, an AI output, or a lookup result.

What happens if ElevenLabs returns an error?

The workflow pauses on the failed node, the error message is captured in the run log, and you can retry the run with one click. Auto-retry policies are configurable per workflow with exponential backoff up to 5 attempts.

Does Speech to Text support batch operations?

Yes. Run Speech to Text inside a Loop node to process arrays. TinyCommand handles ElevenLabs's rate limits automatically so you don't have to throttle manually.

More actions