aaronjmars

OpenDia

Community aaronjmars
Updated

Connect your browser to AI models. Just use Dia on Chrome, Arc or Firefox.

OpenDia

The open alternative to Dia / Perplexity Comet Connect your browser to AI models.No browser switching neededβ€”works seamlessly with Chrome, Firefox, and any Chromium browser. Private, local-first & MCP focused.If you are not technical / never used MCPs before, we recommend using Perplexity Comet.

npm versionGitHub releaseLicense: MIT

πŸ“Ί See it in Action

OpenDia Demo

πŸš€ What is OpenDia?

OpenDia lets AI models control your browser automatically. The key advantage? It leverages everything you already haveβ€”your logged-in accounts, saved passwords, cookies, wallets, and browsing history. No need to start from scratch or switch contexts.

πŸ”‘ Use Your Existing Digital Life:

  • βœ… Logged-in accounts: Post to Twitter/X, LinkedIn, Facebook with your existing sessions
  • βœ… Browser data: Access your bookmarks, history, and saved passwords
  • βœ… Extensions & wallets: Use MetaMask, password managers, or any installed extensions
  • βœ… Cookies & sessions: Stay authenticated across all your favorite sites
  • βœ… Local testing: Perfect for development with Cursor - test with real user sessions

✨ Key Benefits:

  • πŸ”„ Universal AI Support: Works with Claude, ChatGPT, Cursor and even local models
  • 🎯 Anti-Detection: Specialized bypasses for Twitter/X, LinkedIn, Facebook
  • πŸ“± Smart Automation: AI understands your pages and finds the right elements
  • πŸ›‘οΈ Privacy-First: Everything runs locally, your data stays with you
  • ⚑ Zero Setup: Get started with one command

🌐 Browser Support

Works with Chrome, Firefox, and any Chromium-based browser:

  • βœ… Google Chrome
  • βœ… Arc
  • βœ… Mozilla Firefox
  • βœ… Microsoft Edge
  • βœ… Brave
  • βœ… Opera
  • βœ… Any Chromium based browser

Also perfect for Cursor users who want to automate their local testing and development workflows!

🎬 What You Can Do

Real workflows you can try today:

πŸ“° Content & Social Media

  • "Summarize all the articles I read today and post a Twitter thread about the key insights"
  • "Find interesting articles related to AI from my bookmarks and create a reading list"
  • "Read this article and post a thoughtful comment on the LinkedIn version"
  • "Check my recent Twitter bookmarks and summarize the main themes"

πŸ“§ Productivity & Research

  • "Browse my latest emails and tell me what needs urgent attention"
  • "Find all the GitHub repos I visited this week and create a summary report"
  • "Extract the main points from this research paper and save them to my notes"
  • "Search my browsing history for that article about AI safety I read last month"

πŸ€– Development & Testing (Perfect for Cursor!)

  • "Test my web app's signup flow and take screenshots at each step"
  • "Fill out this form with test data and check if validation works"
  • "Navigate through my app and check if all the buttons work properly"
  • "Use my connected wallet to test this DeFi interface"

πŸ”„ Advanced Automation

  • "Open tabs for all my daily news sources and summarize the top stories"
  • "Draft replies to my unread messages based on the context"
  • "Monitor this webpage and notify me when the content changes"
  • "Automatically bookmark interesting articles I'm reading"

🎨 Visual Customization & Fun

  • "Apply a cyberpunk theme to this documentation site to make it more engaging"
  • "Make this page dark mode with green text for late-night reading"
  • "Add rainbow party effects to celebrate finishing this project"
  • "Transform this boring form with a retro 80s theme while I fill it out"
  • "Use high contrast styling so I can read this better"

⚑ Quick Start

1. Install the Browser Extension

For Chrome/Chromium browsers:

  1. Download opendia-chrome-1.0.6.zip from releases
  2. Extract the zip file to a folder
  3. Go to chrome://extensions/ (or your browser's extension page)
  4. Enable "Developer mode"
  5. Click "Load unpacked" and select the extracted folder

For Firefox:

  1. Download opendia-firefox-1.0.6.zip from releases
  2. Extract the zip file to a folder
  3. Go to about:debugging#/runtime/this-firefox
  4. Click "Load Temporary Add-on..."
  5. Select the manifest.json file from the extracted folder

Note: Firefox extensions are loaded as temporary add-ons and will be removed when Firefox restarts. This is a Firefox limitation for unsigned extensions.

2. Connect to Your AI

Option 1: Double-click Installation (Recommended)

  1. Download the opendia.dxt file from releases
  2. Double-click the .dxt file to install automatically
  3. The MCP will be added to your Claude Desktop configuration

Option 2: Manual ConfigurationAdd to your Claude Desktop configuration:

{
  "mcpServers": {
    "opendia": {
      "command": "npx",
      "args": ["opendia"]
    }
  }
}

For Cursor or other AI tools, use the same configuration or follow their specific setup instructions.

Usage Modes

Local Mode (Default)

npx opendia
  • Chrome extension: ws://localhost:5555 (auto-discovery enabled)
  • Claude Desktop: stdio (existing config)
  • Local SSE: http://localhost:5556/sse

Port Configuration

# Use custom ports
npx opendia --port=6000              # Uses 6000 (WebSocket) + 6001 (HTTP)
npx opendia --ws-port=5555 --http-port=5556  # Specify individually

# Handle port conflicts  
# Note: Existing OpenDia processes are automatically terminated on startup

Auto-Tunnel Mode

npx opendia --tunnel
  • Automatically creates ngrok tunnel
  • Copy URL for ChatGPT/online AI services
  • Local functionality preserved

Note: For auto-tunneling to work, you need ngrok installed:

macOS:

brew install ngrok

Windows:

# Using Chocolatey
choco install ngrok

# Or download from https://ngrok.com/download

Linux:

# Ubuntu/Debian
curl -s https://ngrok-agent.s3.amazonaws.com/ngrok.asc | sudo tee /etc/apt/trusted.gpg.d/ngrok.asc >/dev/null
echo "deb https://ngrok-agent.s3.amazonaws.com buster main" | sudo tee /etc/apt/sources.list.d/ngrok.list
sudo apt update && sudo apt install ngrok

# Or download from https://ngrok.com/download

Then get your free authtoken from https://dashboard.ngrok.com/get-started/your-authtoken and run:

ngrok config add-authtoken YOUR_TOKEN_HERE

πŸ› οΈ Capabilities

OpenDia gives AI models 18 powerful browser tools:

🎯 Smart Page Understanding

  • Analyze any webpage - AI automatically finds buttons, forms, and interactive elements
  • Extract content intelligently - Get clean text from articles, social posts, or search results
  • Understand context - AI knows what type of page it's looking at and how to interact with it

πŸ–±οΈ Natural Interactions

  • Click anything - Buttons, links, menus - AI finds and clicks the right elements
  • Fill forms smartly - Works even on complex sites like Twitter, LinkedIn, Facebook
  • Navigate seamlessly - Go to pages, scroll, wait for content to load
  • Handle modern web apps - Bypasses detection on social platforms

πŸ“‘ Tab & Window Management

  • Multi-tab workflows - Open, close, switch between tabs automatically
  • Organize your workspace - Let AI manage your browser tabs efficiently
  • Coordinate complex tasks - Work across multiple sites simultaneously

πŸ“Š Access Your Browser Data

  • Bookmarks & History - Find that article you read last week
  • Current page content - Get selected text, links, or full page content
  • Real-time information - Work with whatever's currently on your screen

πŸ›‘οΈ Anti-Detection Features

  • Social media posting - Bypass automation detection on Twitter/X, LinkedIn, Facebook
  • Natural interactions - Mimics human behavior to avoid triggering security measures
  • Reliable automation - Works consistently even on sites that block typical automation tools

🎨 Page Styling & Customization

  • Transform any website - Apply fun themes, custom colors, and visual effects
  • Preset themes - Dark hacker, retro 80s, rainbow party, minimalist zen, and more
  • AI mood styling - Describe a mood and get matching visual design
  • Interactive effects - Matrix rain, floating particles, neon glow, and cursor trails
  • Accessibility themes - High contrast and readable designs for better visibility

πŸ’¬ Example Prompts to Try

Once everything is set up, try asking your AI:

Content Creation:

"Read the article on this page and create a Twitter thread summarizing the main points"

Research & Analysis:

"Look through my browser history from this week and find articles about machine learning. Summarize the key trends."

Social Media Management:

"Check my Twitter bookmarks and organize them into categories. Create a summary of each category."

Productivity:

"Open tabs for my usual morning reading sites and give me a briefing of today's top stories"

Development Testing:

"Fill out this contact form with test data and check if the submission works properly"

Personal Assistant:

"Find that GitHub repo I was looking at yesterday about React components and bookmark it for later"

Page Styling & Fun:

"Apply a dark hacker theme to this page to make it look more interesting"

"Make this boring documentation page feel like a cozy coffee shop"

"Add some matrix rain effects to this page for 30 seconds for a cool screenshot"

"Transform this page with a high contrast theme for better readability"

πŸ—οΈ How It Works

graph LR
    A[AI Model] --> B[OpenDia Server]
    B --> C[Browser Extension]
    C --> D[Your Browser]
    D --> E[Any Website]
  1. You ask your AI to do something browser-related
  2. AI calls OpenDia tools to understand and interact with pages
  3. OpenDia controls your browser through the extension
  4. You get results - AI can see what happened and respond intelligently

πŸ”’ Security & Privacy

Your data stays private:

  • βœ… Everything runs locally - No cloud processing of your browsing data
  • βœ… You control access - Extension only works when you want it to
  • βœ… Open source - Full transparency of what the code does
  • βœ… No tracking - We don't collect or store any of your information

Important: This tool requires broad browser permissions to function. Only use with AI models you trust, and in environments where you're comfortable with browser automation.

🀝 Contributing

Love to have your help making OpenDia better!

Quick Development Setup

git clone https://github.com/aaronjmars/opendia.git
cd opendia

# Start the server
cd opendia-mcp
npm install
npm start

# Load extension in your browser  
# Chrome: Go to chrome://extensions/ β†’ Developer mode β†’ Load unpacked: ./opendia-extension/dist/chrome
# Firefox: Go to about:debugging#/runtime/this-firefox β†’ Load Temporary Add-on β†’ ./opendia-extension/dist/firefox/manifest.json
# Extension will auto-connect to server on localhost:5555

Ways to Contribute

  • πŸ› Report bugs via GitHub Issues
  • πŸ’‘ Share it on social medias
  • πŸ”§ Add new browser capabilities
  • πŸ“– Improve documentation
  • πŸ§ͺ Test with different AI models

πŸ“ License

MIT License - see LICENSE for details.

Ready to supercharge your browser with AI? Get started with npx opendia! πŸš€

MCP Server Β· Populars

MCP Server Β· New

    Southclaws

    Storyden

    With a fresh new take on traditional bulletin board forum software, Storyden is a modern, secure and extensible platform for building communities.

    Community Southclaws
    SocketDev

    Socket MCP Server

    MCP Server

    Community SocketDev
    containers

    Kubernetes MCP Server

    Model Context Protocol (MCP) server for Kubernetes and OpenShift

    Community containers
    microsoft

    fabric-rti-mcp

    MCP server for Fabric Real-Time Intelligence (https://aka.ms/fabricrti) supporting tools for Eventhouse (https://aka.ms/eventhouse), Azure Data Explorer (https://aka.ms/adx, and other RTI services (coming soon)

    Community microsoft
    PaddlePaddle

    PaddleOCR MCP ζœεŠ‘ε™¨

    Awesome multilingual OCR and Document Parsing toolkits based on PaddlePaddle (practical ultra lightweight OCR system, support 80+ languages recognition, provide data annotation and synthesis tools, support training and deployment among server, mobile, embedded and IoT devices)

    Community PaddlePaddle