You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: scrapegraph-js/src/crawl.js
+6Lines changed: 6 additions & 0 deletions
Original file line number
Diff line number
Diff line change
@@ -16,6 +16,7 @@ import { getMockResponse } from './utils/mockResponse.js';
16
16
* @param {boolean} [options.extractionMode=true] - true for AI extraction, false for markdown conversion (NO AI/LLM)
17
17
* @param {boolean} [options.cacheWebsite=true] - Whether to cache the website content
18
18
* @param {number} [options.depth=2] - Maximum depth of the crawl (1-10)
19
+
* @param {number|null} [options.breadth] - Maximum number of links to crawl per depth level. If null/undefined, unlimited (default). Controls the 'width' of exploration at each depth. Useful for limiting crawl scope on large sites. Note: maxPages always takes priority. Ignored when sitemap=true.
19
20
* @param {number} [options.maxPages=2] - Maximum number of pages to crawl (1-100)
20
21
* @param {boolean} [options.sameDomainOnly=true] - Whether to only crawl pages from the same domain
21
22
* @param {boolean} [options.sitemap] - Whether to use sitemap for better page discovery
0 commit comments