emzimmer

Mozilla Readability Parser MCP Server

Community emzimmer
Updated

Fetch just the main content and convert it to markdown. Strips away all the junk, which is cleaner for LLMs and reduces your overall context window.

Mozilla Readability Parser MCP Server

An model context protocol (MCP) server that extracts and transforms webpage content into clean, LLM-optimized Markdown. Returns article title, main content, excerpt, byline and site name. Uses Mozilla's Readability algorithm to remove ads, navigation, footers and non-essential elements while preserving the core content structure. More about MCP.

Features

  • Removes ads, navigation, footers and other non-essential content
  • Converts clean HTML into well-formatted Markdown (also uses Turndown)
  • Returns article metadata (title, excerpt, byline, site name)
  • Handles errors gracefully

Why Not Just Fetch?

Unlike simple fetch requests, this server:

  • Extracts only relevant content using Mozilla's Readability algorithm
  • Eliminates noise like ads, popups, and navigation menus
  • Reduces token usage by removing unnecessary HTML/CSS
  • Provides consistent Markdown formatting for better LLM processing
  • Includes useful metadata about the content

Installation

Installing via Smithery

To install Mozilla Readability Parser for Claude Desktop automatically via Smithery:

npx -y @smithery/cli install server-moz-readability --client claude

Manual Installation

npm install server-moz-readability

Tool Reference

parse

Fetches and transforms webpage content into clean Markdown.

Arguments:

{
  "url": {
    "type": "string",
    "description": "The website URL to parse",
    "required": true
  }
}

Returns:

{
  "title": "Article title",
  "content": "Markdown content...",
  "metadata": {
    "excerpt": "Brief summary",
    "byline": "Author information",
    "siteName": "Source website name"
  }
}

Usage with Claude Desktop

Add to your claude_desktop_config.json:

{
  "mcpServers": {
    "readability": {
      "command": "npx",
      "args": ["-y", "server-moz-readability"]
    }
  }
}

Dependencies

  • @mozilla/readability - Content extraction
  • turndown - HTML to Markdown conversion
  • jsdom - DOM parsing
  • axios - HTTP requests

License

MIT

MCP Server ยท Populars

MCP Server ยท New

    Blazemeter

    BlazeMeter MCP Server

    Official BlazeMeter MCP Server for AI-driven performance testing

    Community Blazemeter
    nirholas

    Universal Contract AI Interface

    Universal Contract AI Interface (UCAI) ๐Ÿ”— ABI to MCP | The open standard for connecting AI agents to blockchain. MCP server generator for smart contracts. Claude + Uniswap, Aave, ERC20, NFTs, DeFi. Python CLI, Web3 integration, transaction simulation. Polygon, Arbitrum, Base, Ethereum EVM chains. Claude, GPT, LLM tooling, Solidity, OpenAI.

    Community nirholas
    Dictation354

    Paper Fetch Skill

    Fetch papers as agent-ready markdown โ€” DOI/URL/title in, structured full text out. CLI ยท MCP ยท Skill.

    Community Dictation354
    Kaelio

    ktx

    ktx is the context layer for analytics agents

    Community Kaelio
    Goldentrii

    AgentRecall

    Persistent, correction-driven memory for AI agents. Cross-session, cross-platform (Claude Code, Codex, Gemini โ€” any MCP client). Learns from mistakes, compresses context to save tokens, consolidates knowledge overnight. npm: agent-recall-mcp

    Community Goldentrii