ActionFireworks AIUpdated June 2026

How do I call Fireworks for LLM inference?

Short answer: You can fireworks chat completion in Fireworks AI by hand from its own interface, but it won’t repeat itself. On TinyCommand, add the Fireworks AI Fireworks Chat Completion action to a workflow, map its 7 inputs from any upstream app, and it runs automatically every time the trigger fires. No code, and a free tier to start.

Fireworks Chat Completion in Fireworks AI — start free
Inputs

The fields this action accepts.

Every field can be mapped from an upstream trigger, AI step, table row, or hard-coded literal.

FieldTypeRequiredDescription
Model
model
optionsRequiredWhich model to use
User Message
message
stringRequiredUser message to send to the model
System Prompt
system_prompt
stringOptionalOptional system instructions that shape the model's behavior
Temperature
temperature
stringOptionalSampling temperature (0–2). Higher = more random.
Max Tokens
max_tokens
stringOptionalMaximum tokens to generate in the response
Top P
top_p
stringOptionalNucleus sampling threshold (0–1)
Response Format
response_format
optionsOptionalForce JSON output (model must support JSON mode)
Sample request
{
"model": "{{trigger.model}}",
"message": "e.g. Summarize this article in 3 bullets",
"system_prompt": "e.g. You are a helpful assistant.",
"temperature": "0.7",
"max_tokens": "1024"
}
Returns
{
"id": "chatcmpl-abc123",
"model": "accounts/fireworks/models/llama-v3p3-70b-instruct",
"usage": {
"total_tokens": 60,
"prompt_tokens": 10,
"completion_tokens": 50
},
"choices": [
{
"message": {
"role": "assistant",
"content": "Sample response"
},
"finish_reason": "stop"
}
]
}

Use these fields in downstream nodes for routing, logging, or error handling.

Triggered by

Apps that pair well as the trigger for Fireworks Chat Completion.

Any of these apps can fire this action as part of a workflow.

FAQ

Questions about Fireworks Chat Completion.

What does the Fireworks Chat Completion action do in Fireworks AI?
Runs chat completion against Fireworks-hosted open-source models. Notable for Firefunction V2 — Fireworks's fine-tuned Llama for reliable function calling. OpenAI-compatible shape.
What inputs does Fireworks Chat Completion require?
Required: Model, User Message. Every input accepts a static value or a variable from any upstream node in your workflow.
Can I use dynamic inputs from earlier workflow nodes?
Yes. Any field on this action can pull values from upstream nodes, whether that's a form response, a trigger payload, an AI output, or a lookup result.
What happens if Fireworks AI returns an error?
The workflow pauses on the failed node, the error message is captured in the run log, and you can retry the run with one click. Auto-retry policies are configurable per workflow with exponential backoff up to 5 attempts.
Does Fireworks Chat Completion support batch operations?
Yes. Run Fireworks Chat Completion inside a Loop node to process arrays. TinyCommand handles Fireworks AI's rate limits automatically so you don't have to throttle manually.
More actions

Other Fireworks AI actions.

Action
Fireworks Embeddings
Generates embeddings from Fireworks-hosted models (BGE, GTE, sentence-transformers). For RAG-pipeline vector generation at competitive pricing.
Action
List Fireworks Models
Returns the Fireworks model catalog with pricing. Useful for model-selection workflows and for per-model cost calculations.
Fireworks Chat Completion in Fireworks AI — start free