Brightdata

Learn how to use Brightdata with Composio

Overview

Enum

BRIGHTDATA

Description

Bright Data provides the world’s #1 web data platform with Web Unlocker for bypassing anti-bot systems, SERP API for search engine data, and pre-made scrapers for popular websites. Collect any web data at scale.

Authentication Details

api_key
stringRequired

Actions

Tool to trigger a site crawl job to extract content across multiple pages or entire domains. use when you need to start a crawl for a given dataset and list of urls.

Action Parameters

custom_output_fields
string
dataset_id
stringRequired
include_errors
boolean
items
arrayRequired

Action Response

data
object
error
string
successful
boolean
Tool to list all available pre-made scrapers (datasets) from bright data's marketplace. use when you need to browse available data sources for structured scraping.

Action Parameters

Action Response

data
object
error
string
successful
boolean
Tool to apply custom filter criteria to a marketplace dataset (beta). use after selecting a dataset to generate a filtered snapshot.

Action Parameters

dataset_id
stringRequired
files
array
filter
objectRequired
records_limit
integer

Action Response

data
object
error
string
successful
boolean
Tool to get available static network cities for a given country. use when you need to configure static proxy endpoints after selecting a country.

Action Parameters

country
stringRequired
pool_ip_type
stringDefaults to dc

Action Response

data
object
error
string
successful
boolean
Tool to list available countries and their iso 3166-1 alpha-2 codes. use when you need to configure zones with valid country codes before provisioning proxies.

Action Parameters

Action Response

data
object
error
string
successful
boolean
Tool to retrieve the scraped data from a completed crawl job by snapshot id. use after triggering a crawl or filtering a dataset to download the collected data.

Action Parameters

format
stringDefaults to json
limit
integer
offset
integer
snapshot_id
stringRequired

Action Response

content
string
data
error
string
successful
boolean
Tool to check the processing status of a crawl job using snapshot id. call before attempting to download results to ensure data collection is complete.

Action Parameters

snapshot_id
stringRequired

Action Response

data
object
error
string
successful
boolean
Tool to list your configured web unlocker zones and proxy endpoints. use to view available zones for web scraping and bot protection bypass.

Action Parameters

Action Response

data
object
error
string
successful
boolean
Tool to bypass bot detection, captcha, and other anti-scraping measures to extract content from websites. use when you need to scrape websites that block automated access or require javascript rendering.

Action Parameters

country
string
device
stringDefaults to desktop
format
stringDefaults to html
render_js
booleanDefaults to True
timeout
integerDefaults to 30
url
stringRequired
wait_for
string

Action Response

data
object
error
string
successful
boolean