Overview
Add Scrape.do to any ModuleX agent or workflow. Enterprise web-scraping API integration: basic HTTP, JS-rendered browser, screenshots, markdown conversion, and credit-usage stats. All five actions hitapi.scrape.do (or /info for usage).
Categories: Web Search & Scraping · Data Extraction · Auth: API Key · Actions: 5
Authentication
API Key Authentication
Authenticate using your Scrape.do API keyRequired Credentials
| Field | Description | Required | Format |
|---|---|---|---|
| Scrape.do API Key | Your Scrape.do API key for authentication | Yes | xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx |
Available Actions
scrape — Perform basic web scraping without JavaScript rendering. Ideal for static websites and APIs.
scrape — Perform basic web scraping without JavaScript rendering. Ideal for static websites and APIs.
Parameters
URL to scrape
HTTP method (GET, POST, PUT, DELETE, HEAD) (Default:
GET)Request body for POST/PUT requests
Use residential & mobile proxy network
Country code for proxy location (e.g. ‘us’, ‘uk’, ‘de’)
Regional proxy location: ‘europe’, ‘asia’, ‘africa’, ‘oceania’, ‘northamerica’, ‘southamerica’
Sticky session ID (0-1000000) for IP persistence
Device emulation (‘desktop’, ‘mobile’, ‘tablet’)
Request timeout in ms (5000-120000)
Retry timeout in ms (5000-55000)
Disable automatic retry on failure
Disable following redirects
Let Scrape.do add default headers
Forward extra upstream headers
Forward client headers to target
Cookies to send (JSON string or header)
Block images, CSS, fonts to speed up loading
Block advertisements
Output format (‘raw’ or ‘markdown’)
Return the origin response body with no parsing
Response
scrape_with_js — Scrape JavaScript-rendered pages using headless browser. Essential for SPAs and dynamic content.
scrape_with_js — Scrape JavaScript-rendered pages using headless browser. Essential for SPAs and dynamic content.
Parameters
URL to scrape
HTTP method (GET, POST, PUT, DELETE, HEAD) (Default:
GET)Request body for POST/PUT requests
Use residential & mobile proxy network
Country code for proxy location (e.g. ‘us’, ‘uk’, ‘de’)
Regional proxy location: ‘europe’, ‘asia’, ‘africa’, ‘oceania’, ‘northamerica’, ‘southamerica’
Sticky session ID (0-1000000) for IP persistence
Device emulation (‘desktop’, ‘mobile’, ‘tablet’)
Request timeout in ms (5000-120000)
Retry timeout in ms (5000-55000)
Disable automatic retry on failure
Disable following redirects
Let Scrape.do add default headers
Forward extra upstream headers
Forward client headers to target
Cookies to send (JSON string or header)
Block images, CSS, fonts to speed up loading
Block advertisements
Output format (‘raw’ or ‘markdown’)
Wait condition: ‘domcontentloaded’, ‘networkidle0’, ‘networkidle2’, ‘load’
CSS selector to wait for before capturing
Additional wait time in ms
Browser viewport width (Default:
1920)Browser viewport height (Default:
1080)JSON-encoded Play-with-Browser action list
Response
take_screenshot — Capture webpage screenshots: viewport, full-page, or element-specific.
take_screenshot — Capture webpage screenshots: viewport, full-page, or element-specific.
Parameters
URL to capture
Capture full page instead of viewport (Default:
false)CSS selector for element-specific screenshot
Use residential & mobile proxy network
Country code for proxy location (e.g. ‘us’, ‘uk’, ‘de’)
Regional proxy location: ‘europe’, ‘asia’, ‘africa’, ‘oceania’, ‘northamerica’, ‘southamerica’
Sticky session ID (0-1000000) for IP persistence
Device emulation (‘desktop’, ‘mobile’, ‘tablet’)
Request timeout in ms (5000-120000)
Retry timeout in ms (5000-55000)
Disable automatic retry on failure
Disable following redirects
Viewport width (Default:
1920)Viewport height (Default:
1080)Wait condition for render completion
CSS selector to wait for before capturing
Additional wait time in ms
Block advertisements
Let Scrape.do add default headers
Cookies to send
Response
scrape_to_markdown — Scrape web pages and convert content to clean, readable markdown format.
scrape_to_markdown — Scrape web pages and convert content to clean, readable markdown format.
Parameters
URL to scrape
Enable JavaScript rendering (Default:
false)HTTP method (Default:
GET)Request body for POST/PUT
Use residential & mobile proxy network
Country code for proxy location (e.g. ‘us’, ‘uk’, ‘de’)
Regional proxy location: ‘europe’, ‘asia’, ‘africa’, ‘oceania’, ‘northamerica’, ‘southamerica’
Sticky session ID (0-1000000) for IP persistence
Device emulation (‘desktop’, ‘mobile’, ‘tablet’)
Request timeout in ms (5000-120000)
Retry timeout in ms (5000-55000)
Disable automatic retry on failure
Disable following redirects
Block images, CSS, fonts to speed up loading
Block advertisements
Let Scrape.do inject default headers
Cookies to send
JSON-encoded Play-with-Browser script
Response
get_usage_stats — Get API usage statistics and remaining credits for your Scrape.do account.
get_usage_stats — Get API usage statistics and remaining credits for your Scrape.do account.
Response
Limits & Quotas
- Each scrape action exposes 20+ optional knobs (proxy routing,
geo-targeting, device emulation, cookies, headers, wait
conditions, viewport). All map to Scrape.do’s camelCase query
string keys via a single
_PARAM_MAPtranslation table. take_screenshotis mutually-exclusive between viewport / full-page / element modes — the tool validates thatfull_pageandselectoraren’t both set.- Output shape varies per upstream response:
- JSON →
payload: dict - text/html/markdown →
data: strwithis_binary=False - image/* →
data: <base64>withis_binary=True
- JSON →
- 180s timeout for scrape operations (matches legacy); 30s for the usage-stats endpoint.