Configure the Reddit Scraper settings and filters
Configure your Reddit Scraper squid settings. Control sort order per input type, filter by date, limit results, and choose whether to include posts, comments, or both.Documentation Index
Fetch the complete documentation index at: https://docs.lobstr.io/llms.txt
Use this file to discover all available pages before exploring further.
Token YOUR_API_KEYapplication/jsonparams object:
| Parameter | Type | Default | Description |
|---|---|---|---|
| max_results | integer | unlimited | Maximum number of rows (posts + comments) to collect per task |
| max_unique_results_per_run | integer | unlimited | Maximum unique results across all tasks in the run |
| fetch_since | string | null | Stop collecting content older than this threshold. Use a relative duration (24h, 7d, 2w) or absolute date (YYYY-MM-DD HH:MM) |
| fetch_since_timezone | string | null | Timezone for interpreting an absolute fetch_since date (e.g. Europe/Paris). Ignored for relative values |
| sort_comments | string | best | Sort order for post thread comments: best, top, new, controversial, old, q&a |
| sort_search | string | relevance | Sort order for search results: relevance, hot, top, new, comments |
| sort_subreddit | string | hot | Sort order for subreddit feeds: hot, new, top, rising, controversial |
| include_nsfw | boolean | false | Include NSFW (adult) posts and comments in results |
| skip_comments | boolean | false | Return only post rows — skip all comments |
| skip_posts | boolean | false | Skip post rows. For post URLs: returns comments only. For user URLs: returns profile and comments. For subreddit URLs: returns subreddit metadata only |
| Setting | Type | Description |
|---|---|---|
| name | string | Display name for your squid configuration |
| concurrency | integer | Number of parallel scraping threads (default: 1) |
| export_unique_results | boolean | Export only unique results (deduplicated) |
| to_complete | boolean | Run until all tasks complete |
| no_line_breaks | boolean | Remove line breaks from results |
fetch_since filters based on the created_at field of each post or comment. It is useful for incremental runs to avoid re-collecting old content.