WhatsApp MCP Server
A single-binary Go MCP server that wraps whatsmeow to expose a personal WhatsApp account to LLMs. Your MCP client (Claude Desktop, Cursor, etc.) launches whatsapp-mcp serve over stdio on demand — no background daemon, no two-process bridge. Messages are cached in local SQLite and only travel to the model when the agent calls a tool.
Unaffiliated. This is an independent open-source project. It is not affiliated with, endorsed by, or otherwise associated with Meta Platforms, Inc., WhatsApp, or whatsmeow. "WhatsApp" is a trademark of Meta Platforms, Inc., used here nominatively to describe interoperability.
This started as a fork of lharries/whatsapp-mcp and has since been rewritten as a single Go binary. What it adds over the original:
- LID resolution — normalises
@lidJIDs to real phone numbers for accurate contact matching. - Sent-message storage — outgoing messages are persisted locally so conversation history stays complete.
- Disappearing-message timers — outgoing messages inherit the group chat's ephemeral timer automatically.
- Targeted history sync — on-demand per-chat backfill via the
request_synctool. - Extended tool surface — 41 tools (see below): reactions, replies, edits, revoke, mark-read, typing, is-on-whatsapp, full group admin, blocklist, polls (create + vote + tally), contact cards, view-once flag, presence, privacy settings, and the profile "About" text.
- Single-instance enforcement — a
flock(2)onstore/.lockprevents twoserveprocesses racing on the same SQLite files.
Setup
Prerequisites
- Go 1.25+ (build-time only; runtime needs just the compiled binary).
- An MCP client that speaks stdio (Claude Desktop, Cursor, etc.).
- FFmpeg (optional) — required only for
send_audio_messagewhen the input is not already.oggOpus. Without it, usesend_fileto send raw audio. - Windows: CGO must be enabled — see docs/windows.md.
Install
git clone https://github.com/Sealjay/mcp-whatsapp.git
cd mcp-whatsapp
make build # writes ./bin/whatsapp-mcp
Pair your phone (first run only)
./bin/whatsapp-mcp login
Scan the QR code with WhatsApp on your phone (Settings → Linked Devices → Link a Device). The pairing persists to ./store/whatsapp.db. Re-run login only when WhatsApp invalidates the session or you want to switch accounts.
Connect your MCP client
whatsapp-mcp serve is an HTTP daemon on 127.0.0.1:8765 (or $WHATSAPP_MCP_ADDR). MCP clients connect to it over HTTP:
// Claude Desktop — ~/Library/Application Support/Claude/claude_desktop_config.json
{
"mcpServers": {
"whatsapp": { "url": "http://127.0.0.1:8765/mcp" }
}
}
// Claude Code — .claude/mcp.json (project) or ~/.claude/mcp.json (user)
{
"mcpServers": {
"whatsapp": { "type": "http", "url": "http://127.0.0.1:8765/mcp" }
}
}
// Cursor — ~/.cursor/mcp.json
{
"mcpServers": {
"whatsapp": { "type": "http", "url": "http://127.0.0.1:8765/mcp" }
}
}
Restart the client. WhatsApp appears as an available integration. Closing and reopening the client reconnects to the daemon — no process spawn, no per-session handshake, no stdin/stdout juggling.
Sending files
send_file and send_audio_message accept a media_path argument pointing at the file to send. By default, the path must live under ./store/uploads/ (resolved relative to your -store directory). On first run, serve creates the directory automatically; drop files you intend to send into it.
To allow a different directory, set WHATSAPP_MCP_MEDIA_ROOT (absolute path) in the MCP client's env block:
{
"mcpServers": {
"whatsapp": {
"command": "{{PATH_TO_REPO}}/bin/whatsapp-mcp",
"args": ["-store", "{{PATH_TO_REPO}}/store", "serve"],
"env": { "WHATSAPP_MCP_MEDIA_ROOT": "/Users/me/whatsapp-outbox" }
}
}
}
Paths outside the allowed root are rejected with a clear error so Claude can ask you to move the file or update the env var. Symlinks inside the root are resolved before the check, so a symlink that points out of the root is also rejected. Do not place secrets inside the allowed root — the allowlist bounds what the tool can read, but anything inside is fair game.
Architecture
One binary, five internal packages:
cmd/whatsapp-mcp/ login / serve / smoke subcommands
internal/client/ whatsmeow client wrapper (send, download, events, history, features)
internal/store/ SQLite cache, LID resolution, query layer
internal/media/ ogg parsing, waveform synthesis, ffmpeg shell-out
internal/mcp/ mark3labs/mcp-go server + tool registrations
Process lifecycle
serve starts when the MCP client needs it and exits when the client disconnects. A flock(2) on store/.lock prevents two instances racing on the same store (WhatsApp would kick one of the two linked-device connections anyway).
The trade-off: events are persisted to SQLite only while serve is running. When the MCP client quits, the WhatsApp connection closes. On the next launch, whatsmeow emits events.HistorySync events that backfill conversations into SQLite, but the recovery window is governed by WhatsApp's server-side retention for multidevice clients — not by this codebase. Messages that arrive during a gap long enough to outlast WhatsApp's retention are not recoverable. For shorter, known gaps, the request_sync tool triggers a per-chat backfill on demand.
Data storage
Everything lives under ./store/ (override with -store DIR):
store/messages.db— local chat/message cache, indexed for search.store/whatsapp.db— whatsmeow's own device/session state.store/.lock— ephemeral advisory lock for single-instanceserve.
Data flow
- The client sends a JSON-RPC
tools/calltoserveover stdio. - The MCP layer dispatches to an internal handler.
- The handler either queries the local SQLite store or calls whatsmeow directly (send, download, reactions, etc.).
- Incoming WhatsApp events are persisted to the store in a background goroutine inside the same process, so query tools always see current state.
Running the daemon
The daemon is designed to run independently of any MCP client. Three supported lifecycle models:
macOS — launchd. Template at docs/launchd/com.sealjay.whatsapp-mcp.plist. Copy to ~/Library/LaunchAgents/, replace {{PATH_TO_REPO}} / {{STORE_DIR}} placeholders, launchctl load. Daemon runs from login onwards.
Linux — systemd user unit. Template at docs/systemd/whatsapp-mcp.service. Copy to ~/.config/systemd/user/, replace placeholders, systemctl --user enable --now whatsapp-mcp.
Claude Code SessionStart hook. For project-scoped lifetimes, drop docs/hooks/setup.sh into your project's .claude/hooks/ and configure settings.json to invoke it. The hook is idempotent — safe to run alongside launchd/systemd.
Manual. ./bin/whatsapp-mcp serve -addr 127.0.0.1:8765 in any terminal. Ctrl-C to stop.
First-time pairing happens in a browser: start the daemon, open http://127.0.0.1:8765/pair, scan the QR with your phone. No terminal required. WhatsApp's multidevice protocol rotates the linked-device session roughly every 20 days; when that happens, the /pair page serves a fresh QR automatically — visit it again and re-pair.
Flags and environment variables for serve:
-addr host:port(envWHATSAPP_MCP_ADDR, default127.0.0.1:8765).-allow-remote(explicit opt-in to bind a non-loopback address).WHATSAPP_MCP_MEDIA_ROOT— allowed root forsend_file/send_audio_messagepaths.WHATSAPP_MCP_DEBUG=1— disable JID/body redaction in logs.
Tools
41 tools, grouped by purpose.
Read / query
| Tool | Purpose |
|---|---|
search_contacts |
Substring search across cached contact names and phone numbers |
list_messages |
Query + filter messages; returns formatted text with context windows |
list_chats |
List chats with last-message preview; sort by activity or name |
get_chat |
Chat metadata by JID |
get_message_context |
Before/after window around a specific message |
download_media |
Download persisted media to a local path |
request_sync |
Ask WhatsApp to backfill history for a chat |
Send
| Tool | Purpose |
|---|---|
send_message |
Send a text message to a phone number or JID |
send_file |
Send image/video/document/raw audio with optional caption; view_once: bool marks image/video/audio submessages as view-once (ignored for documents) |
send_audio_message |
Send a voice note (auto-converts via ffmpeg if not .ogg Opus); supports view_once: bool |
send_poll |
Send a poll with a question and 2+ options; selectable_count controls how many options a voter may pick. Generates the 32-byte MessageSecret required for votes to decrypt |
send_poll_vote |
Cast a vote on a previously-seen poll; options must match option names exactly |
get_poll_results |
Return the tally for a poll we have cached (includes 0-vote options) |
send_contact_card |
Send a contact card; synthesises a vCard 3.0 from name + phone, or pass a raw vcard to skip synthesis |
Message actions
| Tool | Purpose |
|---|---|
mark_read |
Mark specific message IDs as read |
mark_chat_read |
Ack the most recent incoming messages in a chat to clear the unread badge |
send_reaction |
React to a message (empty emoji clears an existing reaction) |
send_reply |
Text reply that quotes a prior message |
edit_message |
Edit a previously-sent message |
delete_message |
Revoke (delete for everyone) a message |
send_typing |
Set per-chat composing / recording presence |
Groups
| Tool | Purpose |
|---|---|
create_group |
Create a group with a name and initial participants |
leave_group |
Leave a group |
list_groups |
List all groups the user is a member of |
get_group_info |
Full group metadata (participants, settings, invite config) |
update_group_participants |
Add / remove / promote / demote participants (action: add|remove|promote|demote) |
set_group_name |
Change the group subject |
set_group_topic |
Change the group description; empty string clears it |
set_group_announce |
Toggle announce-only mode (only admins can send) |
set_group_locked |
Toggle locked mode (only admins can edit group metadata) |
get_group_invite_link |
Get the invite link; reset: true revokes the previous link first |
join_group_with_link |
Join a group via a chat.whatsapp.com URL or bare invite code |
Blocklist
| Tool | Purpose |
|---|---|
get_blocklist |
Return the current blocklist |
block_contact |
Block a contact by phone number or JID |
unblock_contact |
Unblock a contact |
Privacy / presence / status
| Tool | Purpose |
|---|---|
send_presence |
Set own availability (available or unavailable) — distinct from per-chat send_typing |
get_privacy_settings |
Current privacy settings as JSON |
set_privacy_setting |
Change one privacy setting by name + value (strict enum validation; invalid combinations are rejected) |
set_status_message |
Update the profile "About" text; empty string clears it |
Admin
| Tool | Purpose |
|---|---|
is_on_whatsapp |
Batch-check which phone numbers are registered on WhatsApp |
get_status |
Report whether the bridge is connected and which account it's paired as |
Deferred
Intentionally not exposed yet:
subscribe_presence— no persistence layer for presence events, skipped to avoid a dangling tool.- Profile photo setter — upstream whatsmeow doesn't expose a user-level setter.
- Approval-mode participants, communities, newsletters — low-use surface, deferred.
Limitations
- Prompt-injection risk: as with many MCP servers, this one is subject to the lethal trifecta. Prompt injection in incoming messages could lead to private data exfiltration — treat the tool surface accordingly.
- Re-authentication: WhatsApp may invalidate the linked-device session periodically; re-run
./bin/whatsapp-mcp loginwhen that happens. - Message gaps when
serveisn't running: events only flow into SQLite while the binary is alive. Messages sent during an offline window are recovered on next reconnect only if WhatsApp's multidevice retention still holds them; for longer gaps userequest_syncper chat, or accept the loss. - Single instance per store: only one
whatsapp-mcp servecan hold the store lock. Parallel MCP clients must point at different-storedirectories (and therefore different paired sessions). - Windows: requires CGO and a C compiler — see docs/windows.md.
- Upstream bounds: message fetch/send is bounded by what whatsmeow supports against the WhatsApp web multidevice API.
- Log redaction is obfuscation, not anonymisation. Partial knowledge of your contacts allows correlation from
…last4. Symlinks inside./store/uploads/are resolved before the path check so they cannot escape, but the root itself is a trust boundary — only place files you intend to send inside it.
Development
make test # unit tests
make test-race # with -race
make vet # go vet
make e2e # build + JSON-RPC smoke over stdio (requires -tags=e2e)
make smoke # boot-test the server without connecting to WhatsApp
Upgrading whatsmeow
Weekly CI runs an upstream upgrade probe. To do it manually:
make upgrade-check
This bumps go.mau.fi/whatsmeow@main, re-tidies, builds, and tests. If green, commit the go.mod / go.sum changes.
scripts/mdtest-parity.sh in CI fails the build early if upstream removes or renames any whatsmeow method we call — it's the canary for API drift.
Troubleshooting
connect failed …onserve— run./bin/whatsapp-mcp loginfirst.servecannot display a QR because its stdout is reserved for MCP JSON-RPC.another whatsapp-mcp instance is already running— only oneservecan hold the store lock. Check for a stray process (ps aux | grep whatsapp-mcp) or another MCP client pointed at the same-storedirectory.- QR doesn't display — the terminal doesn't render half-block Unicode. Try iTerm2, Windows Terminal, or similar.
- Device limit reached — WhatsApp caps linked devices. Remove one from Settings → Linked Devices on your phone.
- No messages loading — after initial auth, it can take several minutes for history to backfill. Use
request_syncto target a specific chat. - WhatsApp out of sync — delete both database files (
store/messages.dbandstore/whatsapp.db) and re-runlogin. ffmpeg not found—send_audio_messageneeds ffmpeg onPATHto convert non-Opus audio. Usesend_filefor raw audio instead.
Debug logging
By default, JIDs in stderr logs are redacted to …<last-4-chars-of-user-part> and message bodies are summarised as [<length>B: text|url|command]. To see full content while actively debugging:
As a flag:
./bin/whatsapp-mcp -debug serveAs an env var in your MCP client config:
"env": { "WHATSAPP_MCP_DEBUG": "1" }
Turn it back off once you're done — redaction is there so shared log snippets don't leak conversation content.
Honesty disclaimer. The …last4 scheme is obfuscation for log-reader convenience, not anonymisation. Someone with independent knowledge of your contacts can still correlate …4567 with a specific phone number. Treat redacted logs as "probably safe to paste into a GitHub issue", not "anonymised".
For Claude Desktop integration issues, see the MCP documentation.
Contributing
Contributions welcome via pull request. See CONTRIBUTING.md.
Licence
MIT Licence — see LICENSE.