ActionTogether AIUpdated July 2026

How do I call an open-source LLM through Together AI?

Short answer: You can chat completion in Together AI by hand from its own interface, but it won’t repeat itself. On TinyCommand, add the Together AI Chat Completion action to a workflow, map its 7 inputs from any upstream app, and it runs automatically every time the trigger fires. No code, and a free tier to start.

Chat Completion in Together AI — start free All 3 Together AI actions

Chat Completion in Together AI — start free

Inputs

The fields this action accepts.

Every field can be mapped from an upstream trigger, AI step, table row, or hard-coded literal.

Field	Type	Required	Description
Model model	options	Required	Which model to use
User Message message	string	Required	User message to send to the model
System Prompt system_prompt	string	Optional	Optional system instructions that shape the model's behavior
Temperature temperature	string	Optional	Sampling temperature (0–2). Higher = more random.
Max Tokens max_tokens	string	Optional	Maximum tokens to generate in the response
Top P top_p	string	Optional	Nucleus sampling threshold (0–1)
Response Format response_format	options	Optional	Force JSON output (model must support JSON mode)

Sample request

{
  "model": "{{trigger.model}}",
  "message": "e.g. Summarize this article in 3 bullets",
  "system_prompt": "e.g. You are a helpful assistant.",
  "temperature": "0.7",
  "max_tokens": "1024"
}

Returns

{
  "id": "chatcmpl-abc123",
  "model": "meta-llama/Llama-4-Maverick-17B-128E-Instruct-FP8",
  "usage": {
    "total_tokens": 60,
    "prompt_tokens": 10,
    "completion_tokens": 50
  },
  "choices": [
    {
      "message": {
        "role": "assistant",
        "content": "Sample response"
      },
      "finish_reason": "stop"
    }
  ]
}

Use these fields in downstream nodes for routing, logging, or error handling.

FAQ

Questions about Chat Completion.

What does the Chat Completion action do in Together AI?

Runs an open-weight chat model (Llama 4, DeepSeek-V3, Qwen, Mixtral, etc.) on Together AI's fast inference platform. OpenAI-compatible request shape so existing chat-completion code mostly works as is.

What inputs does Chat Completion require?

Required: Model, User Message. Every input accepts a static value or a variable from any upstream node in your workflow.

Can I use dynamic inputs from earlier workflow nodes?

Yes. Any field on this action can pull values from upstream nodes, whether that's a form response, a trigger payload, an AI output, or a lookup result.

What happens if Together AI returns an error?

The workflow pauses on the failed node, the error message is captured in the run log, and you can retry the run with one click. Auto-retry policies are configurable per workflow with exponential backoff up to 5 attempts.

Does Chat Completion support batch operations?

Yes. Run Chat Completion inside a Loop node to process arrays. TinyCommand handles Together AI's rate limits automatically so you don't have to throttle manually.

How do I call an open-source LLM through Together AI?

The fields this action accepts.

Apps that pair well as the trigger for Chat Completion.

Questions about Chat Completion.

Other Together AI actions.