Video URL Analyzer MCP
MCP server to analyze YouTube, TikTok & Instagram videos from URL — transcripts, AI insights, tutorial extraction
Features · Quick Start · Tools · Usage · Security · العربية
What is This?
Video URL Analyzer MCP is a Model Context Protocol (MCP) server that lets Claude (or any MCP-compatible AI) analyze videos from YouTube, TikTok, and Instagram — just paste a URL. Powered by Google's Gemini API with full audio + visual analysis, it extracts transcripts, provides AI-powered insights, and can even extract executable tutorial steps.
30-second demo
Paste a YouTube, TikTok, or Instagram URL into Claude and ask:
Analyze this video. Give me:
1. a concise summary
2. transcript highlights with timestamps
3. important visual details not obvious from the transcript
4. any tools, commands, products, or code shown on screen
Video URL Analyzer MCP turns a video link into structured context: transcript, visual understanding, Q&A, and tutorial extraction.
Features
- YouTube Analysis — Direct analysis via Gemini API (no download needed)
- TikTok & Instagram — Async job pattern with yt-dlp download + Gemini Files API
- Full Audio + Visual — Analyzes both video frames AND audio/speech
- 6 Tools — analyze, transcript, Q&A, watch & analyze, execute tutorials, check jobs
- Bilingual — Supports Arabic and English prompts and responses
- Async Jobs — Background processing prevents Claude Desktop timeout crashes
- Security Hardened — URL allowlist, SSRF protection, command injection prevention, path traversal blocking
- Zero-Config Install —
uvx video-url-analyzer-mcpand you are running
Supported Platforms
| Platform | Method | Speed |
|---|---|---|
| YouTube | Direct Gemini analysis — no download needed | Instant |
| TikTok | tikwm.com API (fast) → yt-dlp fallback | ~8s |
| Page scrape via curl_cffi (fast) → yt-dlp fallback | ~10s |
YouTube videos are analyzed directly through Gemini's native video understanding — zero download, zero upload, maximum speed.
Quick Start
Option 1: uvx (Recommended)
Requires uv.
Claude Desktop -- add to claude_desktop_config.json:
{
"mcpServers": {
"video-analyzer": {
"command": "uvx",
"args": ["video-url-analyzer-mcp"],
"env": {
"GEMINI_API_KEY": "your_key"
}
}
}
}
Claude Code:
claude mcp add video-analyzer -s user -e GEMINI_API_KEY=your_key -- uvx video-url-analyzer-mcp
Cursor / VS Code -- add to .cursor/mcp.json or .vscode/mcp.json:
{
"servers": {
"video-analyzer": {
"command": "uvx",
"args": ["video-url-analyzer-mcp"],
"env": { "GEMINI_API_KEY": "your_key" }
}
}
}
Windsurf -- add to ~/.codeium/windsurf/mcp_config.json:
{
"mcpServers": {
"video-analyzer": {
"command": "uvx",
"args": ["video-url-analyzer-mcp"],
"env": { "GEMINI_API_KEY": "your_key" }
}
}
}
Claude Code on Windows
# Install uv if needed
powershell -ExecutionPolicy ByPass -c "irm https://astral.sh/uv/install.ps1 | iex"
# Restart PowerShell, then add the MCP server
claude mcp add video-analyzer -s user -e GEMINI_API_KEY=your_key -- uvx video-url-analyzer-mcp
# Verify
claude mcp list
Option 2: pip install
pip install video-url-analyzer-mcp
Option 3: From source
git clone https://github.com/u2n4/video-url-analyzer-mcp.git
cd video-url-analyzer-mcp
pip install -e .
Tools
| Tool | What it does |
|---|---|
analyze_video |
Full audio + visual analysis with custom prompts. Uses Gemini for state-of-the-art multimodal understanding. |
get_transcript |
Extract timestamped transcript with speaker identification. Supports 100+ languages via auto-detection. |
ask_about_video |
Ask any question — "How many people appear?", "What brand is shown at 0:45?", "Summarize the main argument." |
watch_and_analyze |
Extract tutorial steps, shell commands, code snippets, and file paths from technical videos. |
execute_tutorial_steps |
Review extracted steps safely, then execute with confirmation. Sandboxed with command & path validation. |
check_analysis_job |
Poll background job status for TikTok/Instagram async downloads. |
Safety note:
execute_tutorial_stepsis intended for reviewed, user-approved tutorial steps. Treat commands extracted from videos as untrusted. Prefer review/dry-run first, and do not execute commands from unknown videos without understanding them.
- Default behavior is review-oriented.
confirm=truemay execute commands.- Users must review and understand commands before execution.
How It Works
YouTube — Synchronous: URL is sent directly to Gemini API for instant analysis (no download).
TikTok & Instagram — Asynchronous: Video is downloaded via yt-dlp, uploaded to Gemini Files API, analyzed, then cleaned up. Returns a job_id immediately — poll with check_analysis_job.
Privacy & Data Flow
This server is designed to be explicit about where video data goes.
| Platform | Data flow |
|---|---|
| YouTube | The video URL is sent to Gemini for native multimodal analysis. The server does not download the YouTube video by default. |
| TikTok | The video may be downloaded temporarily, uploaded to Gemini Files API for analysis, then cleaned up after processing. |
| The video may be downloaded temporarily, uploaded to Gemini Files API for analysis, then cleaned up after processing. |
Notes:
GEMINI_API_KEYis required for full multimodal analysis in the current version.- Browser cookies are disabled by default and only used if
VIDEO_ANALYZER_COOKIES=true. - Analysis results may be stored in
ANALYSES_DIRdepending on configuration. - Do not analyze private, sensitive, or confidential videos unless you are comfortable with this data flow.
Usage Examples
# Full video analysis
analyze_video("https://www.youtube.com/watch?v=dQw4w9WgXcQ")
# Custom analysis prompt
analyze_video("https://www.tiktok.com/@user/video/123",
prompt="List every product shown and estimate prices")
# Multilingual transcript extraction
get_transcript("https://www.instagram.com/reel/ABC123/", lang="ar")
# Ask specific questions about video content
ask_about_video("https://youtu.be/abc",
question="What programming language is used in the tutorial?")
# Watch & build — extract tutorial steps
watch_and_analyze("https://www.youtube.com/watch?v=tutorial123")
Try it
Analyze this YouTube video and give me a summary, transcript highlights, and visual details.
Ask this video: what tools, apps, products, or commands appear on screen?
Watch this tutorial and extract the steps, commands, file paths, and warnings. Do not execute anything.
حلل هذا الفيديو بالعربي، وطلع لي الملخص، أهم النقاط، وأي أوامر أو أدوات تظهر في الشرح.
Architecture
| Component | Role |
|---|---|
| Gemini API | Multimodal model — full audio + visual understanding in a single pass |
| FastMCP 3.x | MCP protocol framework over stdio transport |
| yt-dlp + curl_cffi | Video download with Chrome browser impersonation to bypass anti-bot |
| tikwm.com API | TikTok fast-path fallback when yt-dlp is WAF-blocked |
| Background Jobs | Async threading for TikTok/Instagram to prevent Claude Desktop timeouts |
video-url-analyzer-mcp/
├── pyproject.toml # Package metadata & dependencies
├── src/
│ └── video_url_analyzer_mcp/
│ ├── __init__.py # Package init + version
│ ├── __main__.py # python -m support
│ └── server.py # Main MCP server (all 6 tools)
├── .env.example # Environment variable template
├── llms.txt # AI-readable project summary
├── llms-install.md # AI-readable install guide
├── CONTRIBUTING.md
├── CHANGELOG.md
└── LICENSE
Platform Detection
URLs are automatically routed to the correct pipeline:
- YouTube:
youtube.com,youtu.be,youtube.com/shorts/ - TikTok:
tiktok.com,vm.tiktok.com,vt.tiktok.com - Instagram:
instagram.com/reels/,instagram.com/reel/,instagram.com/p/
Security
This server has been hardened against a comprehensive threat model:
| Layer | Protection |
|---|---|
| SSRF | URL allowlist — only YouTube, TikTok, Instagram domains accepted. Private IPs, localhost, file:// blocked. |
| Command Injection | shell=False + shlex.split(). Dangerous command blocklist (rm -rf, reverse shells, eval, pipe-to-shell). |
| Path Traversal | 25+ sensitive path patterns blocked (.ssh, .aws, .env, system dirs, AppData). |
| TLS | Full certificate validation on all downloads. |
| Browser Cookies | Opt-in only via VIDEO_ANALYZER_COOKIES=true. Disabled by default. |
| Download Size | Hard limit of 100 MB per video. |
| DoS Protection | Max 10 concurrent background jobs. Auto-expiry after 1 hour. Storage cap of 200 analyses. |
| Schema Validation | Gemini JSON responses validated before execution. Response size capped at 500K chars. |
| Dependencies | All versions pinned in pyproject.toml. |
Known Limitations
- Requires
GEMINI_API_KEYfor full video understanding in the current version. - TikTok and Instagram support can be affected by platform anti-bot changes.
- Private, deleted, region-locked, or login-required videos may fail.
- Very long videos may hit provider limits, take longer, or cost more.
- TikTok/Instagram processing is asynchronous; use
check_analysis_jobto poll status. - Results depend on Gemini model availability, API quota, and rate limits.
- Transcript quality depends on audio clarity, language, captions, and platform metadata.
Configuration
| Variable | Description | Default |
|---|---|---|
GEMINI_API_KEY |
Google Gemini API key (required) | — |
ANALYSES_DIR |
Directory to store analysis results | ./analyses |
VIDEO_ANALYZER_COOKIES |
Enable browser cookies for yt-dlp | false |
Tech Stack
| Technology | Purpose |
|---|---|
| google-genai | Google Gemini API SDK |
| FastMCP | MCP protocol framework |
| yt-dlp | Video downloader |
| curl_cffi | Browser impersonation (TLS fingerprint) |
| python-dotenv | Environment variable loading |
Troubleshooting
| Issue | Solution |
|---|---|
GEMINI_API_KEY not set |
Create .env file or pass via environment variable |
| TikTok download fails | tikwm.com fallback activates automatically. Ensure curl_cffi is installed. |
| Instagram download fails | pip install curl_cffi for browser impersonation support |
ENOENT on Windows |
Use uvx video-url-analyzer-mcp as the command |
| Claude Desktop timeout | TikTok/Instagram run in background — use check_analysis_job(job_id) to poll |
| Python not found | Install Python 3.10+ from python.org |
No API Key / Client AI Fallback Roadmap
Gemini API provides the best current experience because it can analyze audio and visuals together.
A future fallback mode is planned for users who do not want to provide an API key or when the API is unavailable:
Video URL
→ extract metadata, captions/transcript, and selected keyframes locally
→ return structured context to the MCP client
→ let the user's AI client analyze the prepared context
Planned modes:
| Mode | Status | Description |
|---|---|---|
| API mode | Available now | Uses Gemini for full multimodal video analysis. |
| Client AI fallback | Planned | The MCP server prepares transcript, metadata, and keyframes for the client AI to analyze. |
| Local basic mode | Planned | Returns metadata/transcript/keyframes only, without external model analysis. |
Important: Client AI fallback quality will depend on the MCP client. Some MCP clients may not pass image content from tool results to the model reliably, so transcript/metadata fallback will remain important.
Contributing
See CONTRIBUTING.md for guidelines.
License
MIT — see LICENSE.
Support
If you find this useful, please star this repository!
Made with ❤️ in the Eastern Province of Saudi Arabia.
العربية
خادم تحليل الفيديو بالذكاء الاصطناعي
خادم MCP لتحليل الفيديو باستخدام Google Gemini — احدث واقوى نموذج ذكاء اصطناعي متعدد الوسائط من جوجل.
المميزات
| الاداة | الوصف |
|---|---|
analyze_video |
تحليل شامل للصوت والصورة مع دعم الاوامر المخصصة |
get_transcript |
استخراج النص المنطوق مع الطوابع الزمنية — يدعم +100 لغة |
ask_about_video |
اسال اي سؤال عن محتوى الفيديو |
watch_and_analyze |
استخراج خطوات الشروحات التقنية والاوامر والاكواد |
execute_tutorial_steps |
مراجعة وتنفيذ الخطوات المستخرجة بامان |
المنصات المدعومة
| المنصة | السرعة |
|---|---|
| يوتيوب | فوري — تحليل مباشر بدون تحميل |
| تيك توك | ~8 ثواني — واجهة tikwm.com السريعة |
| انستاجرام | ~10 ثواني — استخراج مباشر من الصفحة |
التثبيت السريع
git clone https://github.com/u2n4/video-url-analyzer-mcp.git
cd video-url-analyzer-mcp
pip install -e .
الامان
الخادم محمي ضد:
- SSRF — قائمة بيضاء للنطاقات المسموحة فقط
- حقن الاوامر — حظر الاوامر الخطيرة + تنفيذ بدون shell
- اختراق المسارات — حظر 25+ مسار حساس
- حماية من الحمل الزائد — حد اقصى 10 مهام متزامنة
الحصول على مفتاح API
- اذهب الى Google AI Studio
- انشئ مفتاح API مجاني
- ضعه في ملف
.env
الخصوصية وتدفق البيانات
- يوتيوب: يتم إرسال رابط الفيديو إلى Gemini للتحليل المباشر، بدون تنزيل الفيديو افتراضياً.
- تيك توك وانستغرام: قد يتم تنزيل الفيديو مؤقتاً ثم رفعه إلى Gemini Files API للتحليل، وبعدها يتم تنظيف الملفات المؤقتة.
- مفتاح
GEMINI_API_KEYمطلوب حالياً للتحليل الكامل. - لا تحلل فيديوهات خاصة أو حساسة إذا لم تكن مرتاحاً لطريقة تدفق البيانات.
القيود المعروفة
- يحتاج مفتاح Gemini API للتحليل الكامل في النسخة الحالية.
- تيك توك وانستغرام قد يتأثران بتغييرات المنصات أو الحماية ضد البوتات.
- الفيديوهات الخاصة أو المحذوفة أو التي تحتاج تسجيل دخول قد لا تعمل.
- الفيديوهات الطويلة قد تأخذ وقتاً أطول أو تصطدم بحدود المزود.
خطة العمل بدون API لاحقاً
الخطة القادمة هي إضافة وضع fallback بحيث يقوم السيرفر بتجهيز النص، البيانات الوصفية، ولقطات مختارة من الفيديو، ثم يترك التحليل للذكاء الاصطناعي الموجود في العميل نفسه.