- Integrations
- /
- Firecrawl
- /
- Actions
- /
- Extract Structured Data
ActionFirecrawlUpdated May 2026
How do I extract structured data from a page?
Short answer: Drop the "Firecrawl → Extract Structured Data" action anywhere in your workflow, map the inputs from upstream nodes, and publish.
Inputs
The fields this action accepts.
Every field can be mapped from an upstream trigger, AI step, table row, or hard-coded literal.
| Field | Type | Required | Description |
|---|---|---|---|
URL url | string | Required | Fully-qualified URL of the page to extract from. |
Extraction Prompt prompt | string | Optional | Optional natural-language description of what to extract. Either Prompt or Schema (or both) must be set — Prompt alone lets the model pick the shape; Schema alone gives strict structure; together is most reliable. |
Output Schema schema | json_schema | Optional | Optional JSON Schema describing the desired output shape. Define properties + types row-by-row. Either Schema or Prompt (or both) must be set. |
Only Main Content only_main_content | boolean | Optional | Strip navigation/footer/sidebar boilerplate before extraction. Almost always true — cleaner extraction, fewer tokens, lower cost. |
Sample request
{"url": "e.g. https://news.ycombinator.com","prompt": "e.g. Extract the product name, price (number), and short description.","schema": "{{trigger.schema}}","only_main_content": "{{trigger.only_main_content}}"}
Returns
{"data": {"json": {"title": "I Moved My Digital Stack to Europe","points": 271},"metadata": {"sourceURL": "https://news.ycombinator.com"}},"success": true}
Use these fields in downstream nodes for routing, logging, or error handling.
Triggered by
Apps that pair well as the trigger for Extract Structured Data.
Any of these apps can fire this action as part of a workflow.
FAQ
Questions about Extract Structured Data.
What does the Extract Structured Data action do in Firecrawl?
Pass a URL and a schema; Firecrawl extracts matching fields. For "scrape product details from this page into our DB" workflows where you want typed JSON rather than raw HTML.
What inputs does Extract Structured Data require?
Required: URL. Every input accepts a static value or a variable from any upstream node in your workflow.
Can I use dynamic inputs from earlier workflow nodes?
Yes. Any field on this action can pull values from upstream nodes, whether that's a form response, a trigger payload, an AI output, or a lookup result.
What happens if Firecrawl returns an error?
The workflow pauses on the failed node, the error message is captured in the run log, and you can retry the run with one click. Auto-retry policies are configurable per workflow with exponential backoff up to 5 attempts.
Does Extract Structured Data support batch operations?
Yes. Run Extract Structured Data inside a Loop node to process arrays. Tiny Command handles Firecrawl's rate limits automatically so you don't have to throttle manually.
More actions
Other Firecrawl actions.
Action
Crawl Site
Starting from a URL, Firecrawl follows links and fetches every page within scope. Returns markdown-formatted content of each. For "ingest a docs site into a RAG index" or "build a knowledge base from a brand's blog" workflows.
ActionGet Firecrawl Agent Result
Polls an agent task for completion and returns the result. For Firecrawl's agentic-scraping workflows that perform multi-step browse tasks.
ActionGet Crawl Status
Returns the current status of a running crawl — pages crawled, pages discovered, completion percentage. Poll until status=completed before consuming results.
ActionMap Site URLs
Returns the URL list of a site without fetching content — the fast preflight before deciding what to crawl. Useful for "find all pages under /blog" or "count pages on this competitor's site" inventory workflows.
ActionRun Firecrawl Agent
Runs an agentic browse task — Firecrawl's agent navigates the site and performs the configured extraction goal. For complex scraping that requires sequential page navigation or form interaction.
ActionScrape URL
Fetches and returns cleaned markdown for a single URL. Handles JavaScript rendering and bot evasion. The right tool for "extract content from this one URL for LLM consumption" workflows.
Send extract structured data from your workflows.
Triggered by anything in the catalog. Free tier available. No credit card.