- Integrations
- /
- DeepInfra
- /
- Actions
- /
- DeepInfra Chat Completion
ActionDeepInfraUpdated May 2026
How do I call DeepInfra for LLM inference?
Short answer: Drop the "DeepInfra → DeepInfra Chat Completion" action anywhere in your workflow, map the inputs from upstream nodes, and publish.
Inputs
The fields this action accepts.
Every field can be mapped from an upstream trigger, AI step, table row, or hard-coded literal.
| Field | Type | Required | Description |
|---|---|---|---|
Model model | options | Required | Which model to use |
User Message message | string | Required | Message sent to the model as user role |
System Prompt system_prompt | string | Optional | Optional system instructions |
Temperature temperature | string | Optional | 0-2, higher = more random |
Max Tokens max_tokens | string | Optional | Maximum tokens to generate |
Sample request
{"model": "{{trigger.model}}","message": "Your prompt","system_prompt": "e.g. You are a helpful assistant","temperature": "0.7","max_tokens": "1024"}
Returns
{"id": "chatcmpl_abc","model": "meta-llama/Llama-3.3-70B-Instruct","usage": {"total_tokens": 60,"prompt_tokens": 10,"completion_tokens": 50},"choices": [{"message": {"role": "assistant","content": "Sample"},"finish_reason": "stop"}]}
Use these fields in downstream nodes for routing, logging, or error handling.
Triggered by
Apps that pair well as the trigger for DeepInfra Chat Completion.
Any of these apps can fire this action as part of a workflow.
FAQ
Questions about DeepInfra Chat Completion.
What does the DeepInfra Chat Completion action do in DeepInfra?
Runs chat completion against DeepInfra-hosted open-source models (Llama, Mixtral, Qwen, DeepSeek) using OpenAI-compatible message-array shape. Competitive pricing for OSS inference at scale.
What inputs does DeepInfra Chat Completion require?
Required: Model, User Message. Every input accepts a static value or a variable from any upstream node in your workflow.
Can I use dynamic inputs from earlier workflow nodes?
Yes. Any field on this action can pull values from upstream nodes, whether that's a form response, a trigger payload, an AI output, or a lookup result.
What happens if DeepInfra returns an error?
The workflow pauses on the failed node, the error message is captured in the run log, and you can retry the run with one click. Auto-retry policies are configurable per workflow with exponential backoff up to 5 attempts.
Does DeepInfra Chat Completion support batch operations?
Yes. Run DeepInfra Chat Completion inside a Loop node to process arrays. Tiny Command handles DeepInfra's rate limits automatically so you don't have to throttle manually.
More actions
Other DeepInfra actions.
Action
DeepInfra Embeddings
Generates embeddings from DeepInfra-hosted models (BGE, sentence-transformers). For RAG pipeline vector generation on a budget-friendly OSS inference provider.
ActionList DeepInfra Models
Returns the current DeepInfra model catalog with pricing per model. Useful for model-selection workflows and for per-model cost calculations.
Send deepinfra chat completion from your workflows.
Triggered by anything in the catalog. Free tier available. No credit card.