Xiaozhi AI voice agent MCP bridge โ€” 58 desktop tools for full autonomous PC control

๐Ÿฆž PURPCLAW v7.0 โ€” ULTIMATE

The AI Voice Agent MCP Bridge โ€” Gives your Xiaozhi AI hardware full autonomous PC control via 58 MCP tools. Built by WEEMADSCOTSMAN at Pixel Dynasty.

๐Ÿš€ New Here? Read the Full Setup Guide โ†’

The setup guide covers everything from buying the hardware to your first voice command. Anyone with a Xiaozhi ball can use this.

What Is This?

PURPCLAW is an MCP (Model Context Protocol) bridge that connects a Xiaozhi AI voice agent to your Windows PC. It turns a voice-enabled AI ball into a fully autonomous engineering assistant that can:

  • ๐Ÿ–ฅ๏ธ See your screen โ€” Screenshots + OCR text recognition
  • ๐Ÿ“ธ See YOU โ€” Webcam capture, face/object detection
  • ๐ŸŒ Browse the web โ€” Playwright-powered: click links, fill forms, read pages
  • ๐Ÿ“ Manage files โ€” Read, write, copy, move, delete, search, zip/extract
  • ๐Ÿ–ฑ๏ธ Control your desktop โ€” Mouse clicks, keyboard typing, window management
  • โฌ‡๏ธ Download files โ€” Direct URL downloads
  • ๐Ÿ“ฆ Install packages โ€” pip, npm, or choco
  • ๐Ÿ”Š Control audio โ€” Volume up/down/mute/set
  • ๐Ÿ”ง Run commands โ€” Shell execution with injection protection
  • ๐Ÿง  Remember things โ€” Persistent memory across sessions
  • ๐Ÿฆž Run AI pipelines โ€” Start/stop/monitor PURPCLAW build pipelines

Architecture

โ”Œโ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”     WebSocket      โ”Œโ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”
โ”‚   Xiaozhi Ball   โ”‚ โ—„โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€ โ”‚  xiaozhi_bridge   โ”‚
โ”‚  (DeepSeek V3.1) โ”‚   MCP Protocol    โ”‚     v7.0.0        โ”‚
โ””โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”˜                   โ””โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”ฌโ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”˜
                                                โ”‚
                              โ”Œโ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”ผโ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”
                              โ”‚                 โ”‚                 โ”‚
                        โ”Œโ”€โ”€โ”€โ”€โ”€โ”ดโ”€โ”€โ”€โ”€โ”€โ”   โ”Œโ”€โ”€โ”€โ”€โ”€โ”€โ”ดโ”€โ”€โ”€โ”€โ”€โ”€โ”   โ”Œโ”€โ”€โ”€โ”€โ”€โ”ดโ”€โ”€โ”€โ”€โ”€โ”
                        โ”‚ Playwright โ”‚   โ”‚ PowerShell  โ”‚   โ”‚  Python   โ”‚
                        โ”‚  Browser   โ”‚   โ”‚  .NET APIs  โ”‚   โ”‚  OpenCV   โ”‚
                        โ””โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”˜   โ””โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”˜   โ””โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”˜

58 MCP Tools

Vision (5)

Tool Description
screen_capture Screenshot the screen
screen_ocr Read text from screen via OCR
screen_find_object Detect objects with YOLO
screen_find_template Find image on screen (template match)
screen_info Get monitor sizes

Webcam (3)

Tool Description
webcam_look Take a photo with PC webcam
webcam_detect Detect people/faces/objects
webcam_read OCR text visible to camera

Mouse & Keyboard (3)

Tool Description
mouse_click Click, double-click, right-click, drag
mouse_scroll Scroll wheel
keyboard_type Type text or press shortcuts

UI Automation (1)

Tool Description
find_and_click Find and click UI elements by text

Window Management (3)

Tool Description
window_list List open windows
window_focus Focus a window by title
window_close Close a window

File System (8)

Tool Description
file_read Read file contents
file_write Write/append to files
file_list List directory contents
file_search Search by name or content (Node.js, fast)
file_copy Copy files/directories
file_move Move/rename files
file_delete Delete files (system path protected)
dir_create Create directories

Browser (9) โ€” Playwright-Powered

Tool Description
browser_open Open URL with full interaction
browser_click Click links/buttons by visible text
browser_type Type into form fields
browser_scroll Scroll pages up/down
browser_get_content Read page text content
browser_screenshot Screenshot the browser
browser_navigate Navigate: goto/back/forward/reload
browser_tabs List open tabs
browser_close_tab Close a tab

Downloads (1)

Tool Description
download_file Download file from URL

Process Management (2)

Tool Description
process_list List running processes
process_kill Kill process by name/PID (critical procs blocked)

Audio (1)

Tool Description
volume_control Set volume, mute/unmute, up/down

Archives (2)

Tool Description
zip_create Create zip archives
zip_extract Extract zip archives

Package Management (1)

Tool Description
install_package Install via pip/npm/choco

PURPCLAW Pipeline (4)

Tool Description
purpclaw_start Start AI build pipeline
purpclaw_stop Stop pipeline
purpclaw_status Get pipeline status
purpclaw_logs Read pipeline logs

Git (1)

Tool Description
git_command Run git commands

Communication (4)

Tool Description
http_request HTTP GET/POST/PUT/DELETE
clipboard Read/write clipboard
speak Text-to-speech via Kokoro TTS
notification Desktop toast notifications

Context & System (5)

Tool Description
active_window Get focused window info
system_status CPU, RAM, process overview
disk_info Drive space for all disks
network_info IP, WiFi, internet status
execute_command Execute shell commands (injection-proof)

Apps & Tasks (4)

Tool Description
open_application Launch apps by name
memory Persistent remember/recall/forget/list
task_schedule Schedule delayed tasks
task_list List scheduled tasks

Setup

Prerequisites

  • Node.js 18+
  • Python 3.10+ with opencv-python
  • Playwright (npm i playwright)
  • Xiaozhi.me account + configured agent

Install

git clone https://github.com/weemadscotsman/purpclaw.git
cd purpclaw
npm install
cp .env.example .env
# Edit .env with your Xiaozhi MCP token

Run

# Windows (recommended)
start_xiaozhi_bridge.bat

# Or manually
set XIAOZHI_MCP_URL=wss://api.xiaozhi.me/mcp/?token=YOUR_TOKEN
node lib/xiaozhi_bridge.js

Environment Variables

Variable Description
XIAOZHI_MCP_URL WebSocket URL from Xiaozhi MCP settings
OPENCLAW_GATEWAY OpenClaw WebSocket (default: ws://127.0.0.1:18789)
MCP_BRIDGE_URL Bridge HTTP server (default: http://localhost:3001)

Security

  • Shell injection protection โ€” All user input is sanitized via san() function
  • System path protection โ€” Cannot delete Windows/Program Files/AppData
  • Critical process protection โ€” Cannot kill explorer, csrss, lsass, svchost
  • Destructive command blocking โ€” Blocks format, recursive system deletes, shutdown
  • Tool timeout โ€” Every tool has a timeout to prevent hanging

Tech Stack

  • Runtime: Node.js (single-process, async event loop)
  • Browser: Playwright (Chromium, headless: false)
  • Vision: Python OpenCV + Haar cascade face detection
  • Screen: .NET System.Drawing (fast screenshot via PowerShell)
  • Shell: PowerShell -NoProfile -NonInteractive (fast startup)
  • Protocol: MCP over WebSocket (JSON-RPC 2.0)

Built With

Created by Eddie Cannon (@weemadscotsman) โ€” Edinburgh, Scotland ๐Ÿด๓ ง๓ ข๓ ณ๓ ฃ๓ ด๓ ฟ

  • Pixel Dynasty โ€” Gaming Archive & Collab Hub
  • NDK Threads โ€” ndkthreads.com
  • PixelBits420 โ€” Fair-launch cryptocurrency

License

MIT

MCP Server ยท Populars

MCP Server ยท New