Scraper Studio IDE FAQs

What is a Bright Data Web Scraper?

Bright Data Web Scrapers are automated tools that enable businesses to automatically collect all types of public online data on a mass scale, while heavily reducing in-house expenses on proxy maintenance and development.The Web Scraper delivers enormous amounts of raw data in a structured format, and integrates with existing systems, for immediate use in competitive data-driven decisions.Bright Data has developed hundreds of Web Scrapers customized to popular platforms.

What is Scraper Studio IDE?

Scraper Studio IDE is an integrated development environment for building custom web scrapers. Access public web data at any scale with powerful tools that let you:

Build your scraper in minutes
Debug and diagnose with ease
Bring to production quickly
Browser scripting in simple Javascript

What is an “input” when using a Web Scraper?

When collecting data, your “input” are the parameters you’ll enter to run your collection with. This can include keywords, URL, search items, product ID, ASIN, profile name, check in and check out dates, etc.

What is an “output” when using a Web Scraper?

The output is the data that you’ve collected from a platform based on your input parameters. You’ll receive your data as JSON/NDJSON/CSV/XLSX.

How many free records are included with my free trial?

Each free trial includes 100 records (note: 100 records does not mean 100 page loads).

Why did I receive more statistic records than inputs?

Statistics show total records collected. One input can generate multiple records. Example: If you input 5 product listing URLs and each page contains 20 products, you’ll collect 100 records from 5 inputs.

Can I collect data from multiple platforms?

Yes, we can collect data from large numbers of websites at the same time.

What is a search scraper?

In cases where you don’t know a specific URL, you can search for a term and get data based on that term.

What is a discovery scraper?

A discovery scraper is designed to collect data from listing pages—such as search results, category pages, or directories. It can extract available information directly from these pages (like titles, prices, ratings) and/or collect URLs or identifiers for further detailed scraping.

Can I change the code in the IDE by myself?

Yes, You have full control to edit and customize the code in the IDE. Whether your scraper was created by AI or built from a template, you can:

Modify extraction logic and selectors
Add or remove data fields
Adjust navigation and workflow
Optimize performance
Fix errors or update for website changes

The IDE provides a complete JavaScript development environment with syntax highlighting, code completion, and debugging tools to help you make changes confidently.Alternatively: If you prefer not to code, you can use our Self-healing tool to make adjustments using plain language—the AI will refactor the code for you automatically.

What are the options to initiate requests?

We have 3 option to initiate requests:

Initiate by API - regular request, queue request and replace request
Initiate manually
Schedule mode

How to start using the Web Scraper?

There are two ways to use the data collection tool:

Develop a self-managed scraper on your own
Develop a self-managed scraper using AI
Request a Fully-managed scraper

What is a queue request?

When you are sending more than one API request, a “queue request” means that you’d like your next request to start automatically after your first request is completed, and so on with all other requests.

What is a CPM?

CPM = 1000 page loads

When building a scraper, what is considered as a billable event?

Billable events:

navigate()
request()
load_more()
(later) media file download

How can I confirm that someone is working on the new Web Scraper I requested?

You’ll receive an email that the developer is working on your new Web Scraper, and you will be notified when your scraper is ready.Status of the request can also be found on your dashboard :

How to report an issue on the Scraper Studio IDE?

You can use this form to communicate any issues you have with the platform, the scraper, or the dataset results.

Tickets will be assigned to a different department depending on selected issue type. Please make sure to choose the most relevant type.*

Select a job ID : issued Dataset

Select a type of the issue

Data

This option is only available for managed scrapers. Tickets will be sent direct to your scraper engineer.

Missing fields
Missing records
Missing values
Parsing issues: The dataset results are incorrect

Collection and Delivery

This type of tickets will be addressed to our support agents.

Incomplete delivery: Something went wrong during the delivery
Scraper is slow: The scraper is collecting results slowly or stuck

Other

This type of tickets will be addressed to your account manager.

UI issue : UI does not operate correctly
Product question: General questions regarding using Web Scraper product
Something else is going wrong

(Parsing issues) Use the “bug” red icon to indicate where the incorrect results are

(Parsing issues) Enter the results you expect to receive

Write a description of what went wrong and the URL where the data is collected

If needed, attach an image to support your report

I updated input/output schema of my managed scraper. Can I use it while BrightData updates my scraper?

When input/output schema is updated, the scraper needs to be updated to match new schema. If the scraper is in work and not updated yet, you’ll see ‘Incompatible input/output schema’ error.

via UI
via the API

{"error":{"code":"input(output)_schema_incompatible" ... }}

If you want to initiate it ignoring schema change, you can click ‘Trigger anyway’ on UI. API, you can add

output schema incompatible: override_incompatible_schema=1
input schema incompatible: override_incompatible_input_schema=1

parameter when triggering the scraper:

curl "https://api.brightdata.com/dca/trigger?scraper=ID_COLLECTOR&queue_next=1&override_incompatible_schema=1" -H "Content-Type: application/json" -H "Authorization: Bearer API_KEY" -d "[{\"url\":\"https://targetwebsite.com/product_id/\"}]"

How can I debug real time scrapers?

Click the ‘Bug’ icon under ‘Failed crawls’. You will be redirected to the IDE where you can view failed inputs under the ‘Last errors’ Tab, including the exact error message and error code.We store the last 1000 errors inside the virtual job record so you can sreview example inputs that failed .You can manually re-run failed inputs in the IDE to diagnose what went wrong and then fix the issue.

What should I do if I face an issue with a Web Scraper?

Select “Report an issue” from the Bright Data Control Panel. Once you report your issue, an automatic ticket will be assigned to one of our 14 developers that monitor all tickets on a dailybasis. Make sure to provide details of what the problem is, and if you are not sure, please contact your account manager. Once you report an issue, you don’t need to do anything else, and you’ll receive an email confirming that the issue was reported.

When “reporting an issue”, what information should I include in my report?

Please provide the following information when reporting an issue:

Select the type of problem you’re facing (for example: getting the wrong results/missing data points/the results never loaded/delivery issue/ UI issue/scraper is slow/IDE issue/other)
Please describe in detail the problem that you are facing
You may upload a file that describes the problem

After reporting an issue, we’ll automatically open a ticket that will be promptly handled by our R&D Department.

What is a Data Collector?

In the past, we referred to all of our scraping tools as “Collectors.” A Collector is essentially a web scraper that consists of both interaction code and parser code. It can operate as an HTTP request or in a real browser, with all requests routed through our unlocker network to prevent blocking.Over time, we developed a Dataset Unit that builds on top of one or more Collectors. For example, with a single Collector (direct request), you can scrape a specific URL—such as a product page from an e-commerce site—and receive the parsed data. In more complex scenarios, multiple Collectors can work together, such as when discovering and scraping categories, followed by collecting data on every product within those categories.

How to Create a Data Collector?

You have a few options to create and configure a Data Collector:

Using the Web Scraper IDE: You can design and structure your parser as individual Collectors or as a single Collector with multiple steps. To get started:

Click on the “Web Data Collection” icon on the right.
Navigate to the “My Scrapers” tab.
Select the “Develop a Web Scraper (IDE)” button.

From here, you can build from scratch or explore available templates for guidance. Start here: Create a Data Collector

Requesting a Custom Dataset: If you prefer us to handle it, you can request a custom dataset, and we’ll create the Data Collectors needed to deliver it. To do this, click on the “Request Datasets” button under the “My Datasets” tab and choose the option that best suits your needs. Start here: Request a Custom Dataset

Any system limitations?

We have a limit of 1,000 batch jobs running in parallel in Scraper Studio IDE. If more than 1,000 jobs are triggered, the additional jobs are placed in a queue and will start when earlier jobs finish.

How do I build scraper with AI Chat in Scraper Studio?

The goal of AI Chat is to generate a custom code template tailored to your target website. Simply provide a website URL. The AI will:

Ask clarifying questions about your data requirements
Generate a schema (data structure template) for your review
Create the scraper code once you approve the schema, including extraction logic, navigation, and error handling
Deploy your scraper - Run it immediately or schedule it to run automatically

For how long a snapshot is available after I triggered a collection?

Snapshot retention depends on your collection type:

Batch collections: Snapshots are available for 16 days
Real-time collections: Snapshots are available for 7 days

After this period, snapshots are automatically deleted. Make sure to download or export your data before it expires.To preserve your data:

Download snapshots before expiration
Export to your preferred format (JSON, CSV, etc.)
Set up API integration for automatic data retrieval
Configure automated exports to your storage

Introduction

Product Guides