- Integrations
- /
- OpenAI
- /
- Actions
- /
- Transcribe Audio (Whisper)
ActionOpenAIUpdated May 2026
How do I transcribe audio with OpenAI Whisper?
Short answer: Drop the "OpenAI → Transcribe Audio (Whisper)" action anywhere in your workflow, map the inputs from upstream nodes, and publish.
Inputs
The fields this action accepts.
Every field can be mapped from an upstream trigger, AI step, table row, or hard-coded literal.
| Field | Type | Required | Description |
|---|---|---|---|
Audio File URL file_url | string | Required | Public URL of the audio file to transcribe (mp3, mp4, mpeg, mpga, m4a, wav, webm) |
Model model | options | Required | Model. Options: Whisper v1 |
Language language | options | Optional | Language of the audio (improves accuracy). Leave empty for auto-detect. |
Prompt prompt | string | Optional | Optional text to guide the model's style or continue a previous segment |
Response Format response_format | options | Optional | Response Format. Options: JSON, Plain Text, SRT (subtitles), VTT (subtitles), Verbose JSON (with timestamps) |
Sample request
{"file_url": "https://example.com/audio.mp3","model": "{{trigger.model}}","language": "{{trigger.language}}","prompt": "e.g. Technical discussion about cloud computing","response_format": "{{trigger.response_format}}"}
Returns
{"text": "Hello, this is a sample transcription of the audio file."}
Use these fields in downstream nodes for routing, logging, or error handling.
Triggered by
Apps that pair well as the trigger for Transcribe Audio (Whisper).
Any of these apps can fire this action as part of a workflow.
FAQ
Questions about Transcribe Audio (Whisper).
What does the Transcribe Audio (Whisper) action do in OpenAI?
Transcribes audio into text using OpenAI Whisper. Supports mp3, mp4, mpeg, mpga, m4a, wav, and webm; up to 25 MB per file. Also supports translation to English.
What inputs does Transcribe Audio (Whisper) require?
Required: Audio File URL, Model. Every input accepts a static value or a variable from any upstream node in your workflow.
Can I use dynamic inputs from earlier workflow nodes?
Yes. Any field on this action can pull values from upstream nodes, whether that's a form response, a trigger payload, an AI output, or a lookup result.
What happens if OpenAI returns an error?
The workflow pauses on the failed node, the error message is captured in the run log, and you can retry the run with one click. Auto-retry policies are configurable per workflow with exponential backoff up to 5 attempts.
Does Transcribe Audio (Whisper) support batch operations?
Yes. Run Transcribe Audio (Whisper) inside a Loop node to process arrays. Tiny Command handles OpenAI's rate limits automatically so you don't have to throttle manually.
More actions
Other OpenAI actions.
Action
Analyze Image
Analyzes an image using GPT-4o vision, accepting a URL or base64-encoded image plus a text prompt. Use for OCR, chart extraction, alt-text generation, or visual QA.
ActionChat Completion
Sends a message and gets a response from an OpenAI chat model (gpt-4o, gpt-4o-mini, o-series, etc.). The standard chat-completion action; supports system prompts, temperature, and stop sequences.
ActionChat with Tools
Chat completion with tool/function calling enabled. The model may return tool_calls instead of (or in addition to) text; you execute them and feed results back as tool messages for the next turn.
ActionCreate Batch
Submits a batch of OpenAI requests at a 50% cost discount with a 24-hour SLA. Requires an uploaded JSONL file id with one request per line. Perfect for bulk classification or embedding jobs.
ActionCreate Embedding
Generates vector embeddings for text using text-embedding-3-small or text-embedding-3-large. The de-facto embeddings default for RAG and semantic search.
ActionCreate Image (DALL-E)
Generates images from a text prompt using DALL-E (or gpt-image-1). Supports size, quality, and style controls; returns a URL or base64-encoded image.
Send transcribe audio (whisper) from your workflows.
Triggered by anything in the catalog. Free tier available. No credit card.