ActionNvidia NIMUpdated June 2026

How do I call Nvidia NIM for LLM inference?

Short answer: You can nvidia nim chat completion in Nvidia NIM by hand from its own interface, but it won’t repeat itself. On TinyCommand, add the Nvidia NIM Nvidia NIM Chat Completion action to a workflow, map its 5 inputs from any upstream app, and it runs automatically every time the trigger fires. No code, and a free tier to start.

Nvidia NIM Chat Completion in Nvidia NIM — start free
Inputs

The fields this action accepts.

Every field can be mapped from an upstream trigger, AI step, table row, or hard-coded literal.

FieldTypeRequiredDescription
Model
model
optionsRequiredWhich model to use
User Message
message
stringRequiredMessage sent to the model as user role
System Prompt
system_prompt
stringOptionalOptional system instructions
Temperature
temperature
stringOptional0-2, higher = more random
Max Tokens
max_tokens
stringOptionalMaximum tokens to generate
Sample request
{
"model": "{{trigger.model}}",
"message": "Your prompt",
"system_prompt": "e.g. You are a helpful assistant",
"temperature": "0.7",
"max_tokens": "1024"
}
Returns
{
"id": "chatcmpl_abc",
"model": "meta/llama-3.3-70b-instruct",
"usage": {
"total_tokens": 60,
"prompt_tokens": 10,
"completion_tokens": 50
},
"choices": [
{
"message": {
"role": "assistant",
"content": "Sample"
},
"finish_reason": "stop"
}
]
}

Use these fields in downstream nodes for routing, logging, or error handling.

Triggered by

Apps that pair well as the trigger for Nvidia NIM Chat Completion.

Any of these apps can fire this action as part of a workflow.

FAQ

Questions about Nvidia NIM Chat Completion.

What does the Nvidia NIM Chat Completion action do in Nvidia NIM?
Runs chat completion against Nvidia-hosted optimised inference for Llama, Mixtral, Nemotron, and other open-source models. OpenAI-compatible shape. For enterprise workflows wanting Nvidia's GPU-optimised inference.
What inputs does Nvidia NIM Chat Completion require?
Required: Model, User Message. Every input accepts a static value or a variable from any upstream node in your workflow.
Can I use dynamic inputs from earlier workflow nodes?
Yes. Any field on this action can pull values from upstream nodes, whether that's a form response, a trigger payload, an AI output, or a lookup result.
What happens if Nvidia NIM returns an error?
The workflow pauses on the failed node, the error message is captured in the run log, and you can retry the run with one click. Auto-retry policies are configurable per workflow with exponential backoff up to 5 attempts.
Does Nvidia NIM Chat Completion support batch operations?
Yes. Run Nvidia NIM Chat Completion inside a Loop node to process arrays. TinyCommand handles Nvidia NIM's rate limits automatically so you don't have to throttle manually.
More actions

Other Nvidia NIM actions.

Action
Nvidia NIM Embeddings
Generates embeddings using NIM-hosted models. For RAG-pipeline vector generation in enterprise workflows.
Action
List Nvidia NIM Models
Returns the NIM model catalog. Useful for model selection and capacity planning.
Nvidia NIM Chat Completion in Nvidia NIM — start free