brightdata-mcp

Local
Community
2.0k
Signed
GitHub Repo

Overview

The brightdata-mcp server is a Model Context Protocol (MCP) server that enables AI assistants and agents to access real-time web data through Bright Data’s global proxy and scraping infrastructure. It allows AI workflows to fetch live web content, extract structured data, and perform large-scale web data collection without building or maintaining custom scraping systems.

This makes it ideal for research, market analysis, competitive intelligence, and any AI-driven workflow that depends on fresh, publicly available web data.

Transport

stdio

Tools

  • web_data_amazon_product
  • web_data_walmart_product
  • web_data_google_shopping
  • web_data_linkedin_posts
  • web_data_tiktok_posts
  • web_data_youtube_videos
  • scraping_browser_snapshot
  • scraping_browser_click_ref
  • scraping_browser_screenshot
  • web_data_yahoo_finance_business
  • web_data_crunchbase_company
  • web_data_zoominfo_company_profile
  • web_data_zillow_properties_listing
  • web_data_github_repository_file
  • web_data_reuter_news
  • web_data_google_play_store
  • web_data_apple_app_store
  • web_data_booking_hotel_listings
  • search_engine_batch
  • scrape_batch
  • extract

Key Capabilities

  • Real-time web access — Fetch up-to-date content from websites around the world.
  • Global proxy infrastructure — Leverage Bright Data’s residential, datacenter, ISP, and mobile IP networks with geo targeting.
  • Structured data extraction — Convert unstructured web pages into machine-readable data for analysis.
  • Scalable data collection — Run large or repeated data collection tasks without manual scraping logic.
  • Anti-blocking resilience — Handle CAPTCHAs, rate limits, and blocking through Bright Data’s managed infrastructure.

How It Works

The brightdata-mcp-server runs as an MCP service that brokers requests between AI clients and Bright Data’s APIs. When an AI assistant invokes a tool like fetch_url or extract_data, the server routes the request through Bright Data’s proxy and scraping platform, applying the appropriate network type, geolocation, and access rules.

Authentication is handled using a Bright Data API token configured as an environment variable. The server abstracts away proxy management, retries, and anti-blocking logic, returning clean, structured results to the AI client over the MCP protocol.

This design lets AI assistants incorporate live web intelligence into their reasoning — such as checking current prices, monitoring competitors, gathering research data, or validating information — without embedding brittle scraping code or relying on static datasets. Once configured, Bright Data becomes a native, on-demand data source inside any MCP-compatible AI workflow.