ActionDeepInfraUpdated July 2026

How do I call DeepInfra for LLM inference?

Short answer: You can deepinfra chat completion in DeepInfra by hand from its own interface, but it won’t repeat itself. On TinyCommand, add the DeepInfra DeepInfra Chat Completion action to a workflow, map its 5 inputs from any upstream app, and it runs automatically every time the trigger fires. No code, and a free tier to start.

DeepInfra Chat Completion in DeepInfra — start free All 3 DeepInfra actions

DeepInfra Chat Completion in DeepInfra — start free

Inputs

The fields this action accepts.

Every field can be mapped from an upstream trigger, AI step, table row, or hard-coded literal.

Field	Type	Required	Description
Model model	options	Required	Which model to use
User Message message	string	Required	Message sent to the model as user role
System Prompt system_prompt	string	Optional	Optional system instructions
Temperature temperature	string	Optional	0-2, higher = more random
Max Tokens max_tokens	string	Optional	Maximum tokens to generate

Sample request

{
  "model": "{{trigger.model}}",
  "message": "Your prompt",
  "system_prompt": "e.g. You are a helpful assistant",
  "temperature": "0.7",
  "max_tokens": "1024"
}

Returns

{
  "id": "chatcmpl_abc",
  "model": "meta-llama/Llama-3.3-70B-Instruct",
  "usage": {
    "total_tokens": 60,
    "prompt_tokens": 10,
    "completion_tokens": 50
  },
  "choices": [
    {
      "message": {
        "role": "assistant",
        "content": "Sample"
      },
      "finish_reason": "stop"
    }
  ]
}

Use these fields in downstream nodes for routing, logging, or error handling.

FAQ

Questions about DeepInfra Chat Completion.

What does the DeepInfra Chat Completion action do in DeepInfra?

Runs chat completion against DeepInfra-hosted open-source models (Llama, Mixtral, Qwen, DeepSeek) using OpenAI-compatible message-array shape. Competitive pricing for OSS inference at scale.

What inputs does DeepInfra Chat Completion require?

Required: Model, User Message. Every input accepts a static value or a variable from any upstream node in your workflow.

Can I use dynamic inputs from earlier workflow nodes?

Yes. Any field on this action can pull values from upstream nodes, whether that's a form response, a trigger payload, an AI output, or a lookup result.

What happens if DeepInfra returns an error?

The workflow pauses on the failed node, the error message is captured in the run log, and you can retry the run with one click. Auto-retry policies are configurable per workflow with exponential backoff up to 5 attempts.

Does DeepInfra Chat Completion support batch operations?

Yes. Run DeepInfra Chat Completion inside a Loop node to process arrays. TinyCommand handles DeepInfra's rate limits automatically so you don't have to throttle manually.

How do I call DeepInfra for LLM inference?

The fields this action accepts.

Apps that pair well as the trigger for DeepInfra Chat Completion.

Questions about DeepInfra Chat Completion.

Other DeepInfra actions.