Anandb71

Arbor

Community Anandb71
Updated

The Graph-Native Intelligence Layer for Code.

Arbor

The Graph-Native Intelligence Layer for Code Stop RAG-ing. Start navigating.

Quick Start • Why Arbor? • Features • Architecture • Protocol • Contributing

Why Arbor?

The Vector RAG Problem: Most AI coding assistants treat your codebase like a bag of text. They embed chunks into vectors and hope similarity search finds the right context. The result? Hallucinated connections, missing dependencies, and refactors that break everything downstream.

Arbor thinks differently.

We parse your code into an Abstract Syntax Tree using Tree-sitter, then build a living graph where every function, class, and variable is a node, and every import, call, and implementation is an edge. When an AI asks "where is authentication handled?", Arbor doesn't grep for "auth" — it traces the call graph to find the actual service that initiates the flow.

Traditional RAG:         Arbor:
                         
"auth" → 47 results      "auth" → AuthController
                                  ├── validates via → TokenMiddleware  
                                  ├── queries → UserRepository
                                  └── emits → AuthEvent

Quick Start

Option 1: Download Pre-built Binary (Recommended)

Download arbor-windows-v0.1.0.zip from the Releases page.

# Unzip and add to PATH, then:
cd your-project
arbor init
arbor index
arbor bridge --viz   # Starts server + opens visualizer

Option 2: Build from Source

# Clone and build
git clone https://github.com/Anandb71/arbor.git
cd arbor/crates
cargo build --release

# Build visualizer (requires Flutter)
cd ../visualizer
flutter build windows

That's it. Your IDE or AI agent can now connect to ws://localhost:7433 and query the graph.

Features

🌲 AST-Graph Intelligence

Every code entity becomes a queryable node. Arbor understands scope, shadowing, and namespace isolation — so when you ask for context, you get the exact logical block, not keyword-matched noise.

⚡ Sub-100ms Incremental Sync

Arbor watches your files and re-parses only the changed AST nodes. In a 100k-line monorepo, saving a file triggers a ~15ms update. You'll never notice it running.

🔍 Blast Radius Analysis

Refactoring a function? Arbor traces every caller, every consumer, every downstream dependency. See the full impact before you break production.

📊 Semantic Ranking

Not all code is equal. Arbor ranks nodes by "centrality" — a function called by 50 others is more architecturally significant than a one-off utility. Context windows get the important stuff first.

🎨 Logic Forest Visualizer

The optional desktop app renders your codebase as an interactive force-directed graph. Custom shaders create bloom and glow effects as you navigate. Features include:

  • Follow Mode: Camera automatically tracks the node the AI is focusing on
  • Low GPU Mode: Disable effects for better performance on older hardware
  • Real-time Sync: Graph updates as you edit code

Architecture

┌─────────────────────────────────────────────────────────────────┐
│                         Your IDE / AI Agent                      │
└─────────────────────────────────────────────────────────────────┘
                                │
                                │ WebSocket (Arbor Protocol)
                                ▼
┌─────────────────────────────────────────────────────────────────┐
│                        Context Sidecar                           │
│  ┌──────────────┐  ┌──────────────┐  ┌──────────────────────┐   │
│  │   Protocol   │  │   Ranking    │  │      Discovery       │   │
│  │   Handler    │  │   Engine     │  │      Engine          │   │
│  └──────────────┘  └──────────────┘  └──────────────────────┘   │
└─────────────────────────────────────────────────────────────────┘
                                │
                                ▼
┌─────────────────────────────────────────────────────────────────┐
│                         Arbor Graph                              │
│  ┌──────────────┐  ┌──────────────┐  ┌──────────────────────┐   │
│  │    Nodes     │  │    Edges     │  │     Relationships    │   │
│  │  (Entities)  │  │   (Links)    │  │    (Semantic)        │   │
│  └──────────────┘  └──────────────┘  └──────────────────────┘   │
└─────────────────────────────────────────────────────────────────┘
                                │
                                ▼
┌─────────────────────────────────────────────────────────────────┐
│                        Pulse Indexer                             │
│  ┌──────────────┐  ┌──────────────┐  ┌──────────────────────┐   │
│  │  Tree-sitter │  │    Watcher   │  │    Delta Sync        │   │
│  │    Parser    │  │   (notify)   │  │    Engine            │   │
│  └──────────────┘  └──────────────┘  └──────────────────────┘   │
└─────────────────────────────────────────────────────────────────┘
                                │
                                ▼
┌─────────────────────────────────────────────────────────────────┐
│                        Your Codebase                             │
│                     TypeScript • Rust • Python                   │
└─────────────────────────────────────────────────────────────────┘

The Protocol

The Arbor Protocol is a simple JSON-RPC interface over WebSocket. Here's what your AI agent can ask:

// Find the architectural root for a concept
{
  "method": "discover",
  "params": { "query": "user authentication" }
}

// Get the blast radius for a function
{
  "method": "impact",
  "params": { "node": "UserService.validateToken" }
}

// Retrieve ranked context for a task
{
  "method": "context",
  "params": { 
    "task": "refactor the payment flow",
    "maxTokens": 8000
  }
}

See docs/PROTOCOL.md for the full specification.

Supported Languages

Language Status Parser
TypeScript tree-sitter-typescript
JavaScript tree-sitter-typescript
Rust tree-sitter-rust
Python tree-sitter-python
Go 🚧 Coming soon
Java 🚧 Coming soon

Adding a new language? See our language contribution guide.

Project Structure

arbor/
├── crates/                 # Rust workspace
│   ├── arbor-core/         # AST parsing, Tree-sitter integration
│   ├── arbor-graph/        # Graph schema, relationships, ranking
│   ├── arbor-watcher/      # File watching, incremental sync
│   ├── arbor-server/       # WebSocket server, protocol handler
│   └── arbor-cli/          # Command-line interface
├── visualizer/             # Flutter desktop app
│   ├── lib/
│   │   ├── core/           # Theme, state management
│   │   ├── graph/          # Force-directed layout
│   │   └── shaders/        # GLSL bloom/glow effects
│   └── shaders/            # Raw GLSL files
└── docs/                   # Extended documentation

Performance

We obsess over speed because slow tools don't get used.

Metric Target Actual
Initial index (10k files) < 5s ~2.3s
Incremental update < 100ms ~15ms
Query response < 50ms ~8ms
Memory (100k LOC) < 200MB ~120MB

Benchmarks run on M1 MacBook Pro. Your mileage may vary, but not by much.

Contributing

We love contributors. Whether you're fixing a typo, adding a language parser, or building something entirely new — you're welcome here.

  1. Read CONTRIBUTING.md
  2. Check the good first issues
  3. Join the discussion in GitHub Discussions

Roadmap

  • Phase 1: Core indexer and CLI
  • Phase 2: Logic Forest visualizer ✅
  • Phase 3: VS Code extension ✅
  • Phase 4: Agentic Bridge (MCP) ✅
  • Phase 5: Linux ARM64/AMD64 + macOS ARM64 CI/CD ✅
  • Phase 6: Language server protocol support
  • Phase 7: Go and Java parser support

Security

Arbor is designed with security in mind:

  • No data exfiltration: All indexing happens locally; no code leaves your machine
  • No API keys required: Works entirely offline
  • No telemetry: Zero phone-home behavior
  • Open source: Full source code available for audit

The Unified Nervous System

Arbor v0.1.0 is feature-complete. The entire stack is now synchronized:

     Claude asks about AuthController
           │
           ▼
    ┌─────────────────┐
    │   Arbor Bridge  │  ← MCP Server (stdio)
    │   (arbor-mcp)   │
    └────────┬────────┘
             │ trigger_spotlight()
             ▼
    ┌─────────────────┐
    │   SyncServer    │  ← WebSocket broadcast
    │   (port 8080)   │
    └────────┬────────┘
             │ FocusNode message
     ┌───────┴───────┐
     │               │
     ▼               ▼
┌─────────┐    ┌─────────┐
│ VS Code │    │  Forest │
│ Golden  │    │ Camera  │
│Highlight│    │Animation│
│ #FFD700 │    │ 600ms   │
└─────────┘    └─────────┘

Experience: Ask Claude, "How does auth work?" → Watch your IDE highlight the file → Watch the Visualizer fly to the node.

CLI Commands

Command Description
arbor init Creates .arbor/ config directory
arbor index Full index of the codebase
arbor query <q> Search the graph
arbor serve Start the sidecar server
arbor export Export graph to JSON
arbor status Show index status
arbor viz Launch the Logic Forest visualizer
arbor bridge Start MCP server for AI integration
arbor bridge --viz MCP + Visualizer together
arbor check-health System diagnostics and health check

License

MIT — use it however you want. See LICENSE for details.

Built for developers who think code is more than text.

"The forest is mapped. The AI is walking the path."

⭐ Star us on GitHub

MCP Server · Populars

MCP Server · New

    egebese

    SEO Research MCP

    A free SEO research tool using Model Context Protocol (MCP) powered by Ahrefs data. Get backlink analysis, keyword research, traffic estimation, and more — directly in your AI-powered IDE.

    Community egebese
    amirsina-mandegari

    GitLab MR MCP

    mcp server for gitlab to get reviews on you merge requests

    Community amirsina-mandegari
    txn2

    kubefwd (Kube Forward)

    Bulk port forwarding Kubernetes services for local development.

    Community txn2
    Anandb71

    Arbor

    The Graph-Native Intelligence Layer for Code.

    Community Anandb71
    VetCoders

    MCP Server Semgrep

    MCP Server Semgrep is a [Model Context Protocol](https://modelcontextprotocol.io) compliant server that integrates the powerful Semgrep static analysis tool with AI assistants like Anthropic Claude. It enables advanced code analysis, security vulnerability detection, and code quality improvements directly through a conversational interface.

    Community VetCoders