houtini-ai

carousels-mcp

Community houtini-ai
Updated

Carousels MCP - turn a blog post into a LinkedIn, Instagram or TikTok Carousel

carousels-mcp

npm versionMCP RegistryKnown Vulnerabilities

MCP server that turns articles, transcripts, and markdown into LinkedIn carousel PDFs, Instagram PNGs, and Threads PNGs. Content in, slides out. No web UI, no cloud service.

Where this came from

I'd been building LinkedIn carousels manually -- Gemini for the content distillation, SVG generation for the slides, Puppeteer for the PDF conversion, then merging everything together. The whole process is documented in this article. It worked well but it was a lot of steps. So I packaged the pipeline into an MCP server. Now Claude handles the orchestration and I just tell it what I want.

What it does

You've got an article or a YouTube transcript or some markdown notes and you want a carousel. This server handles everything: content extraction, slide planning, SVG generation, PDF/PNG output. Give it a URL and it scrapes the content. Give it just a topic and it tells Claude how to research first using other MCP tools, then call back with the results.

The content extractor is actually quite good at picking the right slide type automatically. Numbers and percentages become stat slides. Ordered lists become step-by-step slides. Strong emphasis becomes takeaway callouts. "Before/after" language triggers comparison slides. The usual content, list, and quote types are there obviously, plus code slides for technical content and a CTA slide at the end.

The five tools

Tool What it does
list_templates Shows available templates, including any custom brand kits you've made
preview_slides Converts content into a structured slide plan you can review and edit before committing
render_slides Takes a slide plan and produces SVGs + PDF/PNG output
create_carousel Full pipeline in one call -- content in, carousel out
create_template Creates a custom brand kit by overriding colours and fonts from a base template

Eleven slide types

cover · content · code · quote · list · cta · stat · comparison · steps · takeaway · dataviz

Multi-platform output

Each platform gets the right dimensions and format. No fiddling with resize tools.

  • LinkedIn -- 1080x1350, merged into a single PDF carousel
  • Instagram -- 1080x1080, individual PNG slides at 2x resolution
  • Threads -- 1080x1350, individual PNGs

Getting started

Claude Desktop config

{
  "mcpServers": {
    "carousels": {
      "command": "npx",
      "args": ["-y", "@houtini/carousels-mcp"],
      "env": {
        "CAROUSEL_TEMPLATES_DIR": "/path/to/your/brand-kits"
      }
    }
  }
}

CAROUSEL_TEMPLATES_DIR is optional. Skip it if you're fine with the built-in templates.

Or install globally:

npm install -g @houtini/carousels-mcp

PDF and PNG output

SVG generation works out of the box. For the actual PDF and PNG files you need Puppeteer:

npm install -g puppeteer

Without it everything still works -- you just get SVG files. The server tells you this in its response, so you'll know. For most workflows you want Puppeteer installed. The SVG-only path is there as a fallback.

Gemini MCP integration

This is where it gets interesting. If you've got Gemini MCP installed alongside carousels-mcp, the two servers work together through Claude's orchestration. The carousel server can't see Gemini's tools directly -- MCP servers are isolated from each other -- but the tool descriptions teach Claude how to compose them.

What that means in practice:

  • Research-first carousels. Give create_carousel just a topic and it returns orchestration instructions. Claude picks up the workflow, calls gemini_deep_research for the content, then calls back with the results. You get a carousel about a topic you haven't written about yet.
  • Custom charts. Use preview_slides to get a slide plan, then ask Gemini's generate_svg to create charts or diagrams for specific slides. Inject the SVGs into the plan's customSvg fields and call render_slides. The chart ends up embedded directly in the slide.
  • Quality review. After rendering, Gemini's analyze_image can review the output PNGs and suggest improvements. Useful for catching text overflow or layout issues before you upload.

You don't need Gemini MCP for the core pipeline to work. Content in, carousel out -- that works standalone. But the research and visual enhancement workflows are quite good when both servers are available.

Workflow examples

URL to carousel -- the most common one. Give it an article URL, get a carousel back.

"Turn this into a LinkedIn carousel: https://example.com/my-post"

Topic to carousel -- when you haven't written the content yet. Claude researches via Gemini, then builds the slides.

"Make me a carousel about breeding pigs for LinkedIn"

YouTube to carousel -- works with Supadata MCP for transcript extraction.

"Turn this YouTube video into a carousel: https://youtube.com/watch?v=..."

Data-driven with charts -- research, analyse, then generate chart SVGs to embed in the slides.

"Research renewable energy trends, analyse the data, and make a carousel with charts"

Preview, tweak, render -- when you want to review the slide plan before committing.

"I want to review the slides before generating"
→ preview_slides returns editable JSON
→ You adjust slides, swap types, reorder
→ render_slides produces the final output

Templates

Six built-in templates. I spent a fair amount of time on these -- proper font pairings, warm backgrounds, committed colour palettes. No cold whites, no Inter, no generic SaaS aesthetic.

Template Description
professional Navy/parchment, Plus Jakarta Sans + Frank Ruhl Libre. The safe default.
bold-dark Dark navy, cyan-to-indigo gradients. Space Grotesk.
minimal-light Lots of whitespace, single teal accent, system fonts.
warm-friendly Cream background, amber-to-red warmth. Nunito + Source Serif 4.
neon-dark Deep charcoal with electric lime accents. The trending LinkedIn aesthetic right now, actually.
data-storyteller Built for stat and dataviz slides. Generous whitespace around the numbers so they breathe.

Custom brand kits

Use create_template to build your own. Pick a base template, override the colours and fonts you care about, and it saves a JSON file to your templates directory. Next time you run list_templates, your brand kit shows up alongside the built-ins.

The server is fairly smart about brand detection. If you have custom templates but you're using the default professional, it mentions your brand kits in the response. Gentle nudge, not a nag.

Engagement patterns

There's some deliberate design baked into the SVG generation. Not decorative stuff -- actual patterns from LinkedIn carousel performance research.

Progress bar. Every slide gets a gradient bar at the bottom showing position in the deck. The Zeigarnik effect means people who see "3 of 8" feel compelled to keep swiping. It's thin and unobtrusive, but it works.

50 words per slide maximum. The content extractor enforces this. More than about three seconds of reading time per slide and people scroll past. If your content runs long, it gets split across more slides rather than cramming text in.

Slide 5 anchor. The extractor deliberately places a stat or quote slide at position 5. Mid-carousel drop-off is a real thing and an attention-grabbing slide at that position helps retention.

Output structure

Carousels save to ~/Documents/carousels/ in date-stamped folders:

~/Documents/carousels/2026-03-14-my-article-title/
  slide-01.svg
  slide-02.svg
  ...
  slide-01.pdf       (LinkedIn)
  carousel.pdf       (merged, ready to upload)
  slide-01.png       (Instagram/Threads)
  slide-02.png

Override with outputDir parameter or the CAROUSEL_OUTPUT_DIR environment variable.

Tool reference

create_carousel

The full pipeline. Three input modes:

  • Content mode: Pass content directly -- runs the pipeline immediately.
  • URL mode: Pass sourceUrl -- the server fetches the page, extracts article content, runs the pipeline. If the page is JS-rendered and extraction fails, it returns fallback instructions to use firecrawl_scrape or web_fetch instead.
  • Topic mode: Pass topic without content or URL -- returns orchestration workflows telling Claude how to research first, then call back with content.
Parameter Type Default Description
content string -- Article text, markdown, or transcript. Optional if sourceUrl or topic is provided.
topic string -- Topic for research-first workflow. Returns orchestration instructions when no content/URL given.
templateName string "professional" Template to use
brandName string -- Brand name in slide footers
slideCount number 8 Target slides (4--12)
sourceUrl string -- URL to fetch article content from. Also used for CTA slide link.
outputDir string ~/Documents/carousels/ Output directory
platform string "linkedin" linkedin, instagram, or threads

preview_slides

Returns a JSON slide plan you can review and edit before rendering.

Parameter Type Default Description
content string required Content to convert
slideCount number 8 Target slides (4--12)
sourceUrl string -- URL for CTA slide

render_slides

Renders a slide plan (from preview_slides or hand-crafted) to SVG + PDF/PNG.

Parameter Type Default Description
slidePlan object required Structured slide plan
templateName string "professional" Template to use
brandName string -- Brand name in footers
outputDir string ~/Documents/carousels/ Output directory
platform string "linkedin" linkedin, instagram, or threads

list_templates

Lists built-in and custom templates. No parameters.

create_template

Creates a custom brand kit from a base template.

Parameter Type Default Description
name string required Template name (kebab-case)
baseTemplate string "professional" Template to extend
colors object -- Colour overrides (background, card, title, body, accent, footer)
typography object -- Font family overrides
layout object -- Layout overrides (margin, padding, radius)
googleFonts string[] -- Google Fonts to import

Environment variables

Variable Purpose Default
CAROUSEL_TEMPLATES_DIR Directory for custom brand kit JSONs none (built-in only)
CAROUSEL_OUTPUT_DIR Base output directory ~/Documents/carousels/

Requirements

  • Node.js 18+
  • Puppeteer (optional, for PDF/PNG output)
  • Gemini MCP (optional, for research workflows and custom chart generation)

Licence

MIT

MCP Server · Populars

MCP Server · New