mcp-financial-data

An MCP server (spec 2025-11-25) for SEC EDGAR + FRED + Polygon.io withOAuth 2.1, a citation-grounded 10-K extractor powered by ClaudeSonnet 4.5, and an MCP Apps inline UI for the extracted summary.

Anchor audience: Anthropic Forward Deployed Engineering, Bridgewater /Citadel / Anthropic Finance teams.

License

This repo is project P1 of a 9-project portfolio sprint runningMay 18 – June 21, 2026.

Why this exists

Financial-services AI work consistently fails the same audit checklist:

The agent answered with a number, but the citation pointed at the wrongfiling.
The agent dropped a fact silently when the source didn't support it.
The MCP integration didn't validate JWT scopes per request.
The eval harness wasn't deterministic, so the regression yesterday isindistinguishable from a flaky judge today.

Every one of these is a hard constraint in this repo, enforced in CI.

Architecture

flowchart LR
    Client["MCP Client (Claude Desktop / Cursor / Goose)"] -->|"OAuth 2.1 + PKCE"| Server[FastMCP Server]
    Server --> EDGAR[EDGAR async client]
    Server --> FRED[FRED async client]
    Server --> Polygon[Polygon.io async client]
    Server --> Extractor["10-K Extractor (Claude Sonnet 4.5 + Citations)"]
    Server --> Apps["MCP Apps inline UI"]
    EDGAR --> Cache[(Postgres response cache 24h TTL)]
    FRED --> Cache
    Polygon --> Cache
    Extractor --> Cache
    Server --> Evals["evals/harness.py (JSONL runs)"]

Quickstart

git clone https://github.com/SebAustin/mcp-financial-data.git
cd mcp-financial-data
cp .env.example .env   # fill in API keys
make setup             # uv sync + pre-commit install
make ci                # lint + typecheck + tests + smoke eval (offline)
make serve             # run the MCP server on $MCP_HOST:$MCP_PORT

What's in the box

Surface	Tool / endpoint	Notes
MCP tool	`edgar.list_filings`	Recent SEC filings for a CIK.
MCP tool	`edgar.company_facts`	XBRL facts (us-gaap concepts).
MCP tool	`fred.series`	FRED economic time series.
MCP tool	`polygon.aggregates`	OHLCV aggregate bars.
MCP tool	`tenk.extract_section`	Claude-grounded 10-K claims with citations.
MCP App	`tenk-summary-card`	Inline UI rendering the extractor output.
OAuth	RFC 6750 resource server	Validates JWT against external IdP.

Eval targets (W1)

The eval harness writes per-case JSONL plus a summary JSON toevals/runs/<run_id>/. CI runs --smoke --offline on every PR; nightlyCI runs --full against live APIs with a $5 spend cap.

Metric	W1 target	Source of truth	Notes
Smoke pass rate	5 / 5 cases	`evals/cases/seed.jsonl`	Offline fixtures.
Mean exec-accuracy	≥ 0.95	`evals/metrics.py::exec_accuracy`	Deterministic.
Mean citation coverage	= 1.00 (extractor)	`evals/metrics.py::citation_coverage`	Required for `tenk.*`.
Mean judge score (offline)	≥ 0.90	`evals/metrics.py::judge_with_stub`	0.5·exec + 0.5·citation.
P50 latency (smoke)	≤ 50 ms	harness `latency_ms`	Offline only.
Total cost / smoke run	$0.00	harness `total_cost_usd`	`--offline` enforced.
Coverage gate (src/)	≥ 85%	`pytest --cov-fail-under=85`	mypy `--strict` also gates.

Hard constraints (skim before contributing)

Citations are non-optional for the 10-K extractor. EveryCitedClaim.text carries at least one Citation. Uncited model outputis dropped or moved to notes with [INFERENCE]. Seedocs/adr/0003-citations-required-for-extracted-claims.md.
EDGAR Fair Access is enforced. Every request carries theEDGAR_USER_AGENT env value. Process rate limit ≤ 10 req/sec.
Spend cap. MAX_API_SPEND_USD defaults to 50. Enforced in extractorand harness.
No requests, no print(), no subprocess shell=True, no bare except.
OAuth 2.1 RS only. This server validates JWTs; it is never the IdP.See docs/adr/0004-oauth21-as-resource-server.md.

Layout

src/mcp_financial_data/      # the package
  server.py                  # FastMCP entrypoint
  auth/oauth.py              # OAuth 2.1 RS primitives
  tools/{edgar,fred,polygon} # async API clients
  extractors/tenk.py         # citation-grounded 10-K extractor
  apps/ui.py                 # MCP Apps inline UI registration
  evals/{harness,metrics}    # eval harness (--smoke / --full / --offline)
tests/{unit,integration}     # pytest, 85% gate, integration skipped by default
evals/cases/seed.jsonl       # 5 hand-authored eval cases
evals/runs/                  # per-SHA harness output (gitignored)
docs/adr/                    # MADR architecture decisions
.github/                     # CI templates + issue/PR templates + dependabot

References

License

MIT. See LICENSE.

mcp-financial-data

mcp-financial-data

Why this exists

Architecture

Quickstart

What's in the box

Eval targets (W1)

Hard constraints (skim before contributing)

Layout

References

License

MCP Server · Populars

🦞 OpenClaw — Personal AI Assistant

MarkItDown-MCP

MarkItDown

Awesome MCP Servers

mcp-server-sentry: A Sentry MCP server

MCP Server · New

Heym

Open Computer Use

🚀 TestRail MCP Server

Metabase MCP Server

MK QA Master