How to Set Up Bright Data With Octoparse
Boost your web scraping efficiency by integrating Bright Data with Octoparse, ensuring secure and anonymous data extraction while reducing the risk of IP blocks.
Expand to get your Bright Data Proxy Access Information
Expand to get your Bright Data Proxy Access Information
Your proxy access information
Bright Data proxies are grouped in “Proxy zones”. Each zone holds the configuration for the proxies it holds.
To get access to the proxy zone:
- Login to Bright Data control panel
- Select the proxy zone or setup a new one
- Click on the new zone name, and select the Overview tab.
- In the overview tab, under Access details you can find the proxy access details, and copy them to clipboard on click.
- You will need: Proxy Host, Proxy Port, Proxy Zone username and Proxy Zone password.
- Click on the copy icons to copy the text to your clipboard and paste in your tool’s proxy configuration.
Access Details Section Example
Residential proxy access
To access Bright Data’s Residential Proxies you will need to either get verified by our compliance team, or install a certificate. Read more…
Targeting search engines?
If you target a search engine like google, bing or yandex, you need a special Search Engine Results Page (SERP) proxy API. Use Bright Data SERP API to target search engines. Click here to read more about Bright Data SERP proxy API.
Correct setup of proxy test to avoid “PROXY ERROR”
In many tools you will see a “test proxy” function, which performs a conncectivity test to your proxy, and some add a geolocation test as well, to identify the location of the proxy.
To correctly test your proxy you should target those search queries to:
https://geo.brdtest.com/welcome.txt
.
Some tools use popular search engines (like google.com) as a default test target. Bright Data will block those requests and you tool will show proxy error although your proxy is perfectly fine.
If your proxy test fails, this is probably the reason. Make sure that your test domain is not a search engine (this is done in the tool configuration, and not controlled by Bright Data).
What is Octoparse?
Octoparse is a user-friendly web scraping tool that allows you to collect data from websites without needing any coding knowledge. With its simple point-and-click interface, Octoparse enables you to extract information from even the most complex sites. It offers the flexibility to customize, automate, and schedule scraping tasks, saving the extracted data in formats such as CSV or Excel. Perfect for market research, price tracking, or lead generation, Octoparse makes data collection fast, easy, and efficient!
Octoparse Proxy Integration
Follow these simple steps to integrate Bright Data proxies with Octoparse:
Install Octoparse
Visit the Octoparse website to download and install the tool.
Create a New Task
Click the +New button in the top-left corner, then select Custom Task.
Enter the Target URL
In the URL Input field, enter the URL of the website you wish to scrape, then click Save.
Access Proxy Settings
Once the page loads, navigate to Task Settings > Anti-blocking.
Enable Proxy Usage
Check Access websites via proxies and select Use my own proxies. Then click Configure.
Configure Your Bright Data Proxy
In the pop-up window, enter your Bright Data proxy details in the following format:
- IP/host: Enter
http://brd.superproxy.io/
. - Port: Use the port number provided in your Bright Data dashboard.
- Username: Enter your Bright Data proxy
username
. - Password: Enter your Bright Data proxy
password
.
For country-specific proxies, you can enter a format like your-username-country-US
to receive a US exit node.
If you’re using rotating proxies, set the Switch interval to specify how often the IPs should rotate. For sticky sessions, adjust it according to your preferred session length.
Save Your Settings
Click Confirm to apply the changes, then click Save.
And that’s it! You’ve now successfully integrated Bright Data proxies with Octoparse.