AutomationTasks

Scrape

Extract content from a web page.

Task type: Scrape

Use Scrape to extract raw content from a known web page URL for later analysis.

Run behavior

  • Side effects: This task is read-only or computation-only by default. It returns data for later steps rather than directly writing records or sending messages.
  • Credit usage: This task is not classified as credit-billed by the workflow task registry.

When to use this task

  • Fetch a known page before summarizing it with AI Prompt.
  • Extract public page content from a URL stored on a record.
  • Collect source material for research or content generation workflows.

Setup tips

  • Use known URLs, not search queries.
  • Follow with AI Prompt when you need structured extraction.
  • Store or pass only the parts of the scrape that downstream steps need.

Examples

Summarize a pricing page

Scrape a known competitor pricing page, then use AI Prompt to summarize packaging and positioning.

Outputs

  • Scraped page content and related metadata.

Common pitfalls

  • Using Scrape to discover URLs instead of Search.
  • Scraping pages that require authentication.
  • Passing very large scraped content directly into every downstream prompt.

Reference details

These details are generated from the workflow task registry and match the fields available in the builder.

Required fields

  • url

Variable-aware fields

Fields that accept variable references ({{ inputs.x }}, {{ steps.N.y }}) from the workflow's variable graph:

  • url

Liquid template fields

Fields that are rendered through Liquid (variables plus filters and control flow):

  • url

Builder guidance

When the Workflow Builder Genie configures this task, it follows this guidance:

Use only for known URLs; follow with AI Prompt for extraction if needed.

Default configuration

New instances of this task start with the following configuration:

{
  "name": "Scrape",
  "outputKey": "scrape",
  "taskType": "Scrape",
  "description": "",
  "url": ""
}

On this page