AgentCursor

Human-like cursor browser automation for coding agents, over MCP.

AgentCursor lets any MCP-capable coding agent (Claude Code, Cursor, …) read thepage you're looking at and drive it with a visible, human-like cursor — onethat's convincing both to a person watching the screen and to behavioral botdetection.

The major agent browser servers (Playwright MCP, browser-use, Stagehand,Skyvern) don't ship human-like cursor movement in their open-source core —stealth is paywalled into cloud tiers. AgentCursor is that missing piece, MITlicensed.

Status: phase 1 (Chrome extension) and phase 2 (macOS OS-cursor for genuinelytrusted events) are both implemented. See docs/DESIGN.md.

How it works

Three layers, with one shared wire contract (src/protocol):

coding agent ──MCP/stdio──▶ MCP server ──localhost WebSocket──▶ Chrome extension ──▶ your real tab
                            (src/server)                        (extension/)
                                 │
                                 ▼
                          human-path engine (src/path-engine)
            from + to → timed cursor samples with overshoot, log-normal
            velocity, jitter, off-center landing, dwell — fresh every call

The MCP server generates the cursor sample stream; the extension is a thinreplayer. The same stream works for the content-script driver, thechrome.debugger stealth driver, and (phase 2) the OS cursor — they allimplement one BrowserDriver interface.

Why human-like movement is hard

Modern detectors (DataDome, Castle, reCAPTCHA v3, PerimeterX) flag overly smoothBézier paths, constant velocity, dead-center clicks, zero dwell, teleportingjumps, and replayed identical paths. The engine addresses each:

Fitts's law sets per-move duration from distance and target size.
Asymmetric, eased velocity — not a symmetric min-jerk bell.
Overshoot-and-correct on long moves.
Sub-pixel Gaussian jitter, zero at the endpoints.
Off-center landing inside the target.
Right-skewed dwell before the press.
Per-call entropy — paths are never cached or replayed.

Realism is necessary but not sufficient: content-script events areisTrusted=false, and chrome.debugger still leaks CDP tells. The real evasionendgame is the phase-2 OS cursor (genuine, trusted OS events).

Install

git clone <your-fork-url> agentcursor
cd agentcursor
npm install
npm run build      # builds dist/index.js + extension/dist/*

1. Load the extension

Open chrome://extensions, enable Developer mode.
Load unpacked → select the extension/ folder.
Keep a normal http(s) tab open and focused (not chrome:// or the WebStore — content scripts can't run there).

2. Connect your agent

Claude Code:

claude mcp add agentcursor -- node /absolute/path/to/agentcursor/dist/index.js

Any MCP client (JSON config):

{
  "mcpServers": {
    "agentcursor": {
      "command": "node",
      "args": ["/absolute/path/to/agentcursor/dist/index.js"]
    }
  }
}

The server hosts the extension WebSocket on ws://127.0.0.1:8930 (override withAGENTCURSOR_WS_PORT). If the port is taken, the server exits with a clearmessage. The extension reconnects automatically.

Tools

Tool	What it does
`read_page`	Interactive elements with `[ref]` handles, roles, rects + visible text.
`move_to`	Human path to a `ref` or `x/y`. No click.
`click`	Human move + click. `button`, `double`, `stealth`.
`type`	Type with human key timing; human-clicks a `ref` to focus first.
`scroll`	Eased scroll by `dy`/`dx`.
`navigate`	Point the active tab at a URL.
`get_url`	Current tab URL.
`wait_for`	Wait for a `ref` or visible `text`.

Any driving action accepts stealth: true to deliver trusted events through thechrome.debugger driver (this shows Chrome's "debugging this browser" banner).

Trusted OS cursor (phase 2, macOS)

Content-script events are isTrusted=false, and chrome.debugger still leaksCDP tells. For genuinely trusted, indistinguishable input, switch to theOS-cursor driver, which moves the real macOS system cursor along the same humanpath:

npm install @nut-tree-fork/nut-js        # optional native dependency
AGENTCURSOR_DRIVER=os node dist/index.js

It still reads the page through the extension (keep a normal tab focused), butevery move/click/scroll becomes a real OS event. Requires the Chrome windowvisible and foregrounded at 100% zoom, and Accessibility permission for yourterminal/Node in System Settings → Privacy & Security. Coordinate mapping formulti-monitor / fractional-scaling setups is still rough.

Measuring realism

Open test-detector/index.html and click the targets by hand, then drive themwith the agent. Each click is scored on straightness, velocity variance, dwell,off-center landing, overshoot, and isTrusted — the same features detectorsuse. Use it to tune the engine.

Development

npm run dev         # run the server with tsx (no build)
npm run typecheck    # tsc --noEmit
npm test             # vitest (path-engine + coord-map unit tests)
npm run build:ext    # rebuild just the extension
npm run smoke        # end-to-end run: real MCP client + server, simulated browser

Credits

The path engine builds on the ghost-cursor lineage (Bézier + Fitts) and themouse-dynamics literature — WindMouse, SapiAgent, BeCAPTCHA-Mouse, and thevendor write-ups from DataDome and Castle on what makes synthetic movementdetectable. See docs/DESIGN.md.

License

MIT

AgentCursor

AgentCursor

How it works

Why human-like movement is hard

Install

1. Load the extension

2. Connect your agent

Tools

Trusted OS cursor (phase 2, macOS)

Measuring realism

Development

Credits

License

MCP Server · Populars

🦞 OpenClaw — Personal AI Assistant

MarkItDown-MCP

MarkItDown

Awesome MCP Servers

mcp-server-sentry: A Sentry MCP server

MCP Server · New

Geniuz

ggui

CocoIndex Code MCP Server

Tiger Linear MCP Server

MCP Gemini CLI