Q: Can I scrape behind a login?

A: This template is designed for public pages. For pages requiring a login, you would need to configure cookie sharing in a custom task, though this template works best for publicly accessible information.

Q: Why is the 'content' field in JSON format inside the CSV?

A: To preserve the structure (paragraphs, headers) within a single spreadsheet cell, the content is often wrapped as a JSON object or a Markdown string. This ensures that when you process the data programmatically, you retain the original formatting.

Q: How many URLs can I scrape at once?

A: You can input thousands of URLs. For tasks larger than 10,000 URLs, we recommend splitting them into batches or using Cloud Extraction to speed up the process.

Universal Content Scraper | Octoparse 템플릿

다운로드