You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
| schema | object | No | - | JSON Schema object for structured output |
59
58
| rules | object | No | - | Crawl rules for filtering URLs. Object with optional fields: `exclude` (array of regex URL patterns), `include_paths` (array of path patterns to include, supports wildcards `*` and `**`), `exclude_paths` (array of path patterns to exclude, takes precedence over `include_paths`), `same_domain` (boolean, default: true). See Rules section below for details. |
60
59
| sitemap | boolean | No | false | Use sitemap.xml for discovery |
61
-
| render_heavy_js | boolean | No | false | Enable heavy JavaScript rendering |
62
60
| stealth | boolean | No | false | Enable stealth mode to bypass bot protection using advanced anti-detection techniques. Adds +4 credits to the request cost |
63
61
| webhook_url | str | No | None | Webhook URL to send the job result to. When provided, a signed webhook notification will be sent upon job completion. See [Webhook Signature Verification](#webhook-signature-verification) below.
Copy file name to clipboardExpand all lines: api-reference/endpoint/smartscraper/start.mdx
-7Lines changed: 0 additions & 7 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -54,12 +54,6 @@ SmartScraper allows you to extract specific information from any webpage using A
54
54
Range: 0-50
55
55
</ParamField>
56
56
57
-
<ParamFieldbody="render_heavy_js"type="boolean">
58
-
Optional parameter to enable enhanced JavaScript rendering for heavy JS websites (React, Vue, Angular, SPAs). Use when standard rendering doesn't capture all content.
59
-
60
-
Default: false
61
-
</ParamField>
62
-
63
57
<ParamFieldbody="mock"type="boolean">
64
58
Optional parameter to enable mock mode. When set to true, the request will return mock data instead of performing an actual extraction. Useful for testing and development.
65
59
@@ -117,7 +111,6 @@ curl -X POST 'https://api.scrapegraphai.com/v1/smartscraper' \
117
111
"user_prompt": "Extract all the headlines from this section into a table with the date and URL of the news",
118
112
"total_pages": 2,
119
113
"stealth": true,
120
-
"render_heavy_js": true,
121
114
"headers": {
122
115
"User-Agent": "Mozilla/5.0 (Windows NT 10.0; Win64; x64) AppleWebKit/537.36",
2.**Test incrementally** - If the default doesn't capture all content, try increasing in 1000ms increments (4000, 5000, etc.) rather than setting a very high value.
152
152
153
-
3.**Combine with other parameters** - Use `wait_ms` together with `render_heavy_js` for JavaScript-heavy pages:
154
-
155
-
```python
156
-
response = client.smartscraper(
157
-
website_url="https://heavy-js-site.com",
158
-
user_prompt="Extract all products",
159
-
wait_ms=8000,
160
-
render_heavy_js=True
161
-
)
162
-
```
163
-
164
-
4.**Balance speed and completeness** - Higher wait times ensure more content is captured but increase response time and resource usage.
153
+
3.**Balance speed and completeness** - Higher wait times ensure more content is captured but increase response time and resource usage.
165
154
166
155
## Troubleshooting
167
156
168
157
<Accordiontitle="Content still missing after increasing wait_ms"icon="exclamation-triangle">
169
158
If increasing `wait_ms` doesn't capture all content:
170
159
171
-
- Try enabling `render_heavy_js=True` for JavaScript-heavy pages
172
160
- Check if the content requires user interaction (clicks, scrolls) - use `number_of_scrolls` for infinite scroll pages
173
161
- Verify the content isn't behind authentication - use custom headers/cookies
@@ -209,24 +202,6 @@ When `branding=true` is passed, the response includes a `branding` object with b
209
202
Want to learn more about our AI-powered scraping technology? Visit our [main website](https://scrapegraphai.com) to discover how we're revolutionizing web data extraction.
210
203
</Note>
211
204
212
-
## JavaScript Rendering
213
-
214
-
The `render_heavy_js` parameter controls whether JavaScript should be executed on the target page:
0 commit comments