MCP Server for Oxylabs Scraper
A Model Context Protocol (MCP) server that enables AI assistants like Claude to seamlessly access web data through Oxylabs' powerful web scraping technology.
π Overview
The Oxylabs MCP server provides a bridge between AI models and the web. It enables them to scrape any URL, render JavaScript-heavy pages, extract and format content for AI use, bypass anti-scraping measures, and access geo-restricted web data from 195+ countries.
This implementation leverages the Model Context Protocol (MCP) to create a secure, standardized way for AI assistants to interact with web content.
β¨ Key Features
Scrape content from any site- Extract data from any URL, including complex single-page applications
- Fully render dynamic websites using headless browser support
- Choose full JavaScript rendering, HTML-only, or none
- Emulate Mobile and Desktop viewports for realistic rendering
- Automatically clean and convert HTML to Markdown for improved readability
- Use automated parsers for popular targets like Google, Amazon, and etc.
- Bypass sophisticated bot protection systems with high success rate
- Reliably scrape even the most complex websites
- Get automatically rotating IPs from a proxy pool covering 195+ countries
- Set rendering and parsing options if needed
- Feed data directly into AI models or analytics tools
- Works on macOS, Windows, and Linux
- Comprehensive error handling and reporting
- Smart rate limiting and request management
π‘ Example Queries
When you've set up the MCP server with Claude, you can make requests like:
- Could you scrape
https://www.google.com/search?q=ai
page? - Scrape
https://www.amazon.de/-/en/Smartphone-Contract-Function-Manufacturer-Exclusive/dp/B0CNKD651V
with parse enabled - Scrape
https://www.amazon.de/-/en/gp/bestsellers/beauty/ref=zg_bs_nav_beauty_0
with parse and render enabled - Use web unblocker with render to scrape
https://www.bestbuy.com/site/top-deals/all-electronics-on-sale/pcmcat1674241939957.c
β Prerequisites
Before you begin, make sure you have:
- Oxylabs Account: Obtain your username and password from Oxylabs (1-week free trial available)
Basic Usage
Via Smithery CLI:
- Node.js (v16+)
npx
command-line tool
Via uv:
uv
package manager β install it using this guide
Local/Dev Setup
- Python 3.12+
uv
package manager β install it using this guide
π§© API Parameters
The Oxylabs MCP server supports these parameters:
Parameter | Description | Values |
---|---|---|
url |
The URL to scrape | Any valid URL |
parse |
Enable structured data extraction | True or False |
render |
Use headless browser rendering | html or None |
βοΈ Basic Setup Instructions
Install via Smithery
Automatically install Oxylabs MCP server for Claude Desktop via Smithery:
npx -y @smithery/cli install @oxylabs/oxylabs-mcp --client claude
Install using uv in Claude Desktop
With uv
installed, this method will automatically set up the Oxylabs MCP server in Claude Desktop. Navigate to Claude β Settings β Developer β Edit Config and edit your claude_desktop_config.json
file as follows:
{
"mcpServers": {
"oxylabs_scraper": {
"command": "uvx",
"args": ["oxylabs-mcp"],
"env": {
"OXYLABS_USERNAME": "YOUR_USERNAME_HERE",
"OXYLABS_PASSWORD": "YOUR_PASSWORD_HERE"
}
}
}
}
[!TIP]If you run into errors, try using the full path to
uvx
in thecommand
field. For example,/Users/my-user/.local/bin/uvx
.
π» Local/Dev Setup Instructions
Clone repository
git clone <git:url>
Install dependencies
Install MCP server dependencies:
cd mcp-server-oxylabs
# Create virtual environment and activate it
uv venv
source .venv/bin/activate # MacOS/Linux
# OR
.venv/Scripts/activate # Windows
# Install dependencies
uv sync
Setup with Claude Desktop
Navigate to Claude β Settings β Developer β Edit Config and edit your claude_desktop_config.json
file as follows:
{
"mcpServers": {
"oxylabs_scraper": {
"command": "uv",
"args": [
"--directory",
"/<Absolute-path-to-folder>/oxylabs-mcp",
"run",
"oxylabs-mcp"
],
"env": {
"OXYLABS_USERNAME": "YOUR_USERNAME_HERE",
"OXYLABS_PASSWORD": "YOUR_PASSWORD_HERE"
}
}
}
}
π Debugging
make run
Then access MCP Inspector at http://localhost:5173
. You may need to add your username and password as environment variables in the inspector under OXYLABS_USERNAME
and OXYLABS_PASSWORD
.
π οΈ Technical Details
This server provides two main tools:
- oxylabs_scraper: Uses Oxylabs Web Scraper API for general website scraping
- oxylabs_web_unblocker: Uses Oxylabs Web Unblocker for hard-to-access websites
Web Scraper API supports JavaScript rendering, parsed structured data, and cleaned HTML in Markdown format. Web Unblocker offers JavaScript rendering and cleaned HTML, but doesnβt return parsed data.
[!WARNING]Usage with the MCP Inspector is affected by an ongoing issue with the Python SDK for MCP, see: https://github.com/modelcontextprotocol/python-sdk/pull/85. For Claude, a forked version of the SDK is used as a temporary fix.
License
This project is licensed under the MIT License.
About Oxylabs
Established in 2015, Oxylabs is a market-leading web intelligence collectionplatform, driven by the highest business, ethics, and compliance standards,enabling companies worldwide to unlock data-driven insights.