🚀 OmniCoreAgent - Production-Ready AI Agent Framework

A powerful Python framework for building autonomous AI agents that think, reason, and execute complex tasks. Production-ready agents that use tools, manage memory, coordinate workflows, and handle real-world business logic.

🎯 TL;DR (30 Seconds)

OmniCoreAgent = Build AI agents that:

🤖 Think and reason (not just chatbots)
🛠️ Use tools (APIs, databases, files)
🧠 Remember context (across conversations)
🔄 Orchestrate workflows (multi-agent systems)
🚀 Run in production (monitoring, scaling, reliability)
🔌 Plug & Play (switch memory/event backends at runtime—Redis ↔ MongoDB ↔ PostgreSQL ↔ in-memory)

Perfect for: Developers building AI assistants, automation systems, or multi-agent applications.

Get Started: pip install omnicoreagent → Set LLM_API_KEY → Build your first agent in 30 seconds.

License

📋 Table of Contents

🚀 Getting Started

🎯 TL;DR (30 Seconds)
⚡ Quick Start (1 Minute)
🌟 What is OmniCoreAgent?
🎯 What Problem Does OmniCoreAgent Solve?
📚 Glossary
🏗️ Architecture Overview
📦 Installation & Setup

🎯 Core Features

🤖 OmniAgent - The Heart of the Framework
🧠 Multi-Tier Memory System
📡 Event System
🛠️ Local Tools System
💾 Memory Tool Backend
🚁 Background Agents
🔄 Workflow Agents
🧠 Semantic Tool Knowledge Base
📊 Production Observability
🌐 Universal Model Support
🔌 Built-in MCP Client

📖 Advanced Topics

🎯 Production Examples
🚀 Advanced Features
⚙️ Configuration Guide
🧠 Vector Database Setup
📊 Opik Tracing Setup

🛠️ Development & Support

🧪 Testing
🔍 Troubleshooting
🤝 Contributing
📖 Documentation
🌟 Why OmniCoreAgent?

🔌 MCP Client (CLI Tool - Backward Compatibility)

🖥️ MCP Client CLI Commands
🚦 Transport Types & Authentication
💬 Prompt Management
🎯 Operation Modes
📊 Token & Usage Management

🌟 What is OmniCoreAgent?

OmniCoreAgent is a powerful, production-ready Python framework for building autonomous AI agents that think, reason, and execute complex tasks. It provides a sophisticated yet intuitive architecture for AI agents that go beyond chatbots—agents that use tools, manage memory, coordinate workflows, and handle real-world business logic.

🎯 What Problem Does OmniCoreAgent Solve?

Before OmniCoreAgent

Building production-ready AI agents was challenging:

❌ Complex Setup: Requires integrating multiple libraries (LLM, memory, tools, orchestration)
❌ No Built-in Memory: Manual memory management across conversations
❌ Difficult Orchestration: Complex code to coordinate multiple agents
❌ No Production Infrastructure: Missing monitoring, observability, error handling
❌ Vendor Lock-in: Hard to switch between AI providers
❌ Tool Integration Complexity: Difficult to connect agents to external services

With OmniCoreAgent

Build production-ready agents in minutes:

✅ Simple API: OmniAgent(...) and you're done
✅ Built-in Memory: Redis, PostgreSQL, MongoDB, SQLite with vector database support—switch at runtime!
✅ Plug & Play: Switch memory and event backends at runtime (Redis ↔ MongoDB ↔ PostgreSQL ↔ in-memory)
✅ Workflow Orchestration: Sequential, Parallel, and Router agents out of the box
✅ Production-Ready: Monitoring, observability, error handling built-in
✅ Model Agnostic: Switch between OpenAI, Anthropic, Groq, Ollama, and 100+ models
✅ Easy Tool Integration: Connect to MCP servers or register Python functions as tools

Core Philosophy

OmniCoreAgent is designed for production applications, not experiments. It's a complete framework that handles everything from memory management and tool orchestration to background automation and real-time event streaming. Build agents that plan multi-step workflows, use tools to gather information, validate results, and adapt their approach based on outcomes.

Key Differentiators

🏗️ Complete Agent Framework: Full framework with built-in infrastructure—not just a library
🧠 Multi-Tier Memory System: In-memory, Redis, PostgreSQL, MySQL, SQLite, MongoDB with vector database support—switch at runtime!
📡 Real-Time Event System: Event router with in-memory and Redis Streams backends—switch at runtime!
🛠️ Local Tools System: Register any Python function as an AI tool with simple decorators
🚁 Background Agents: Autonomous task execution with intelligent scheduling
🔄 Workflow Orchestration: Sequential, Parallel, and Router agents for complex multi-agent systems
💾 Memory Tool Backend: Persistent agent working memory for long-running tasks
📊 Production Observability: Opik tracing, metrics, and comprehensive monitoring
🌐 Universal Model Support: Model-agnostic through LiteLLM—use any LLM provider
🔌 Built-in MCP Client: Seamless integration with Model Context Protocol servers—connect to filesystems, databases, APIs, and more

⚡ Quick Start (1 Minute)

Prerequisites Checklist

Before you begin, make sure you have:

Python 3.10+ installed (check with python --version)
LLM API Key from OpenAI, Anthropic, or Groq
Terminal/Command prompt ready

Step 1: Install (10 seconds)

# Using uv (recommended)
uv add omnicoreagent

# Or with pip
pip install omnicoreagent

Step 2: Set API Key (10 seconds)

# Create .env file in your project directory
echo "LLM_API_KEY=your_openai_api_key_here" > .env

# Or manually create .env file with:
# LLM_API_KEY=sk-your-actual-api-key-here

💡 Tip: Get your API key from OpenAI, Anthropic, or Groq

Step 3: Create Your First Agent (30 seconds)

Create a file my_first_agent.py:

import asyncio
from omnicoreagent import OmniAgent

async def main():
    # Create your agent
    agent = OmniAgent(
        name="my_agent",
        system_instruction="You are a helpful assistant.",
        model_config={"provider": "openai", "model": "gpt-4o"}
    )
    
    # Run your agent
    result = await agent.run("Hello, world!")
    print(result['response'])
    
    # Clean up
    await agent.cleanup()

if __name__ == "__main__":
    asyncio.run(main())

Run it:

python my_first_agent.py

✅ Success! You just built an AI agent with:

✅ Session management
✅ Memory persistence
✅ Event streaming
✅ Tool orchestration
✅ Error handling
✅ Production-ready infrastructure

🚨 Common First Errors & Fixes

Error	Fix
`Invalid API key`	Check `.env` file: `LLM_API_KEY=sk-...` (no quotes, no spaces)
`ModuleNotFoundError: omnicoreagent`	Run: `pip install omnicoreagent`
`RuntimeError: Event loop is closed`	Make sure you use `asyncio.run(main())`
`AttributeError: 'coroutine' object has no attribute 'response'`	Make sure you `await agent.run()`

Next Steps

📖 Learn More: See Core Features for advanced capabilities
🎯 See Examples: Check Examples for real-world use cases
🚀 Production Ready: See Production Examples for full setups

🏗️ Architecture Overview

OmniCoreAgent is built on a modular, production-ready architecture:

OmniCoreAgent Framework
├── 🤖 Core Agent System
│   ├── OmniAgent (Main Agent Class)
│   ├── ReactAgent (Reasoning Engine)
│   ├── Tool Orchestration
│   └── Session Management
│
├── 🧠 Memory System (Multi-Backend)
│   ├── InMemoryStore (Fast Development)
│   ├── RedisMemoryStore (Production Persistence)
│   ├── DatabaseMemory (PostgreSQL/MySQL/SQLite)
│   ├── MongoDBMemory (Document Storage)
│   ├── Vector Database Integration (Qdrant/ChromaDB/MongoDB Atlas)
│   └── Memory Management (Episodic, Long-term, Working Memory)
│
├── 📡 Event System
│   ├── InMemoryEventStore (Development)
│   ├── RedisStreamEventStore (Production)
│   └── Real-Time Event Streaming
│
├── 🛠️ Tool System
│   ├── Local Tools Registry (Python Functions)
│   ├── MCP Tools Integration (Built-in Client)
│   ├── Semantic Tool Knowledge Base
│   └── Memory Tool Backend (Persistent Working Memory)
│
├── 🚁 Background Agent System
│   ├── Background Agent Manager
│   ├── Task Registry
│   ├── APScheduler Backend
│   └── Lifecycle Management
│
├── 🔄 Workflow Agents
│   ├── SequentialAgent (Step-by-step chaining)
│   ├── ParallelAgent (Concurrent execution)
│   └── RouterAgent (Intelligent routing)
│
└── 🔌 Built-in MCP Client
    ├── Model Context Protocol server support
    ├── Multiple transport protocols (stdio, Streamable_HTTP, SSE)
    └── Seamless tool integration

📚 Glossary (For Non-Technical Readers)

Understanding key terms helps you get the most out of OmniCoreAgent:

AI Agent: A program that can think, make decisions, and use tools (like APIs or databases) to complete tasks autonomously. Unlike simple chatbots, agents can plan multi-step workflows and adapt based on results.
LLM (Large Language Model): The "brain" of your agent. Examples include GPT-4, Claude, Gemini. These models understand language and can reason about tasks.
Tool: A function your agent can call to perform actions. Examples: "get weather", "send email", "query database", "read file". Tools extend what your agent can do beyond just talking.
Memory: How your agent remembers past conversations and context. OmniCoreAgent supports multiple memory backends (in-memory, Redis, databases) for different use cases.
Session: A single conversation with your agent. Sessions help maintain context and separate different user interactions.
MCP (Model Context Protocol): A standard protocol for connecting AI agents to external tools and services. OmniCoreAgent has built-in MCP client support.
Vector Database: A special database that stores information in a way that allows semantic search (finding information by meaning, not exact text). Used for long-term memory.
Background Agent: An agent that runs automatically on a schedule, without human interaction. Perfect for monitoring, periodic tasks, or automation.
Workflow Agent: A system that coordinates multiple agents to work together. Examples: Sequential (one after another), Parallel (at the same time), Router (intelligent routing).
Observability: Tools and metrics that help you monitor, debug, and optimize your agents in production. Includes tracing, logging, and performance metrics.

🎯 Core Features

1. 🤖 OmniAgent - The Heart of the Framework

OmniAgent is the main class that powers everything. It's a sophisticated agent with enterprise-grade capabilities:

Common Workflows

Basic Agent:

agent = OmniAgent(
    name="assistant",
    system_instruction="You are a helpful assistant.",
    model_config={"provider": "openai", "model": "gpt-4o"}
)
result = await agent.run("What is Python?")

Agent with Custom Tools:

from omnicoreagent import OmniAgent, ToolRegistry

tools = ToolRegistry()
@tools.register_tool("get_weather")
def get_weather(city: str) -> str:
    return f"Weather in {city}: Sunny, 25°C"

agent = OmniAgent(
    name="weather_agent",
    system_instruction="You help with weather information.",
    model_config={"provider": "openai", "model": "gpt-4o"},
    local_tools=tools
)

Production Agent with Memory & Events:

from omnicoreagent import OmniAgent, MemoryRouter, EventRouter

agent = OmniAgent(
    name="production_agent",
    system_instruction="You are a production agent.",
    model_config={"provider": "openai", "model": "gpt-4o"},
    memory_router=MemoryRouter("redis"),
    event_router=EventRouter("redis_stream"),
    agent_config={
        "max_steps": 20,
        "memory_tool_backend": "local"
    }
)

Key Methods

from omnicoreagent import OmniAgent

agent = OmniAgent(
    name="my_agent",
    system_instruction="You are a helpful assistant.",
    model_config={"provider": "openai", "model": "gpt-4o"},
    local_tools=tool_registry,  # Your custom tools
    mcp_tools=[...],  # MCP server connections
    memory_router=MemoryRouter("redis"),
    event_router=EventRouter("redis_stream"),
    agent_config={
        "max_steps": 15,
        "tool_call_timeout": 30,
        "memory_results_limit": 5,
        "enable_tools_knowledge_base": True,
        "memory_tool_backend": "local"
    }
)

# Core Methods
await agent.run(query)                          # Execute agent task (session_id auto-generated)
await agent.run(query, session_id="user_123")   # Execute with specific session_id for context
await agent.connect_mcp_servers()                # Connect to MCP servers
await agent.list_all_available_tools()           # Get all tools (MCP + local)
await agent.get_session_history(session_id)      # Retrieve conversation history
await agent.clear_session_history(session_id)     # Clear history (session_id optional, clears all if None)
await agent.stream_events(session_id)           # Stream real-time events
await agent.get_events(session_id)               # Get event history
agent.get_memory_store_type()                    # Get current memory backend
agent.get_event_store_type()                    # Get current event backend
agent.switch_event_store("redis_stream")         # Switch event backend at runtime (in_memory ↔ redis_stream)
agent.swith_memory_store("redis")                # Switch memory backend at runtime (redis, mongodb, database, in_memory)
agent.swith_memory_store("mongodb")              # Switch to MongoDB
agent.swith_memory_store("database")             # Switch to PostgreSQL/MySQL/SQLite
agent.swith_memory_store("in_memory")            # Switch to in-memory
await agent.cleanup()                            # Clean up resources

Agent Configuration

agent_config = {
    # Execution Limits
    "max_steps": 15,                    # Maximum reasoning steps
    "tool_call_timeout": 30,            # Tool execution timeout (seconds)
    "request_limit": 0,                 # 0 = unlimited, >0 for limits
    "total_tokens_limit": 0,            # 0 = unlimited, >0 for token cap
    
    # Memory Configuration
    "memory_config": {
        "mode": "sliding_window",       # or "token_budget"
        "value": 10000                  # Window size or token limit
    },
    "memory_results_limit": 5,          # Memory retrieval limit (1-100)
    "memory_similarity_threshold": 0.5,  # Similarity threshold (0.0-1.0)
    
    # Tool Knowledge Base
    "enable_tools_knowledge_base": True,  # Semantic tool retrieval
    "tools_results_limit": 10,           # Max tools per query
    "tools_similarity_threshold": 0.1,    # Tool similarity threshold
    
    # Memory Tool Backend for filesystem persistance storage
    "memory_tool_backend": "local"       # "local", or None
}

2. 🧠 Multi-Tier Memory System

OmniCoreAgent provides 5 memory backends with intelligent routing and runtime switching—truly plug and play!

🎯 Runtime Switching (Plug & Play)

The Beauty of OmniCoreAgent: Switch memory backends at runtime without restarting or losing data. Start with Redis, switch to MongoDB, then to PostgreSQL—all on the fly!

from omnicoreagent import OmniAgent, MemoryRouter

# Start with Redis
agent = OmniAgent(
    name="my_agent",
    memory_router=MemoryRouter("redis"),
    model_config={"provider": "openai", "model": "gpt-4o"}
)

# Use Redis for a while...
result = await agent.run("Store this information")

# Switch to MongoDB at runtime - no restart needed!
agent.swith_memory_store("mongodb")
result = await agent.run("Now using MongoDB backend")

# Switch to PostgreSQL
agent.swith_memory_store("database")  # Uses DATABASE_URL env var

# Switch to in-memory for testing
agent.swith_memory_store("in_memory")

# Switch back to Redis
agent.swith_memory_store("redis")

Use Cases:

Development → Production: Start with in_memory, switch to redis when ready
Migration: Switch from one backend to another without downtime
Testing: Quickly switch between backends for testing
Cost Optimization: Use cheaper backends for development, premium for production
A/B Testing: Test different backends with the same agent

Memory Router

from omnicoreagent import MemoryRouter

# In-Memory (Fast Development)
memory = MemoryRouter("in_memory")

# Redis (Production Persistence)
memory = MemoryRouter("redis")  # Uses REDIS_URL env var

# Database (PostgreSQL/MySQL/SQLite)
memory = MemoryRouter("database")  # Uses DATABASE_URL env var
# Supports: postgresql://, mysql://, sqlite://

# MongoDB (Document Storage)
memory = MemoryRouter("mongodb")  # Uses MONGODB_URI env var

# Runtime Switching - Works with any backend!
memory.swith_memory_store("redis")      # Switch to Redis
memory.swith_memory_store("mongodb")    # Switch to MongoDB
memory.swith_memory_store("database")   # Switch to PostgreSQL/MySQL/SQLite
memory.swith_memory_store("in_memory")  # Switch to in-memory

Memory Strategies

# Sliding Window - Keep last N messages
memory.set_memory_config(mode="sliding_window", value=100)

# Token Budget - Keep under token limit
memory.set_memory_config(mode="token_budget", value=5000)

Vector Database Integration

Enable semantic search and long-term memory:

# .env
ENABLE_VECTOR_DB=true
OMNI_MEMORY_PROVIDER=qdrant-remote  # or chroma-remote, mongodb-remote
QDRANT_HOST=localhost
QDRANT_PORT=6333
EMBEDDING_API_KEY=your_embedding_key  # REQUIRED when ENABLE_VECTOR_DB=true

Supported Providers:

Qdrant Remote (Recommended)

# Install and run Qdrant
docker run -p 6333:6333 qdrant/qdrant

# Configure
ENABLE_VECTOR_DB=true
OMNI_MEMORY_PROVIDER=qdrant-remote
QDRANT_HOST=localhost
QDRANT_PORT=6333

ChromaDB Remote

# Install and run ChromaDB server
docker run -p 8000:8000 chromadb/chroma

# Configure
ENABLE_VECTOR_DB=true
OMNI_MEMORY_PROVIDER=chroma-remote
CHROMA_HOST=localhost
CHROMA_PORT=8000

ChromaDB Cloud

ENABLE_VECTOR_DB=true
OMNI_MEMORY_PROVIDER=chroma-cloud
CHROMA_TENANT=your_tenant
CHROMA_DATABASE=your_database
CHROMA_API_KEY=your_api_key

MongoDB Atlas

ENABLE_VECTOR_DB=true
OMNI_MEMORY_PROVIDER=mongodb-remote
MONGODB_URI="your_mongodb_connection_string"
MONGODB_DB_NAME="db name"

Memory Types:

Episodic Memory: Conversation history with semantic search
Long-term Memory: Persistent knowledge storage
Working Memory: Active task state (via Memory Tool Backend)

What You Get:

Long-term Memory: Persistent storage across sessions
Episodic Memory: Context-aware conversation history
Semantic Search: Find relevant information by meaning, not exact text
Multi-session Context: Remember information across different conversations
Automatic Summarization: Intelligent memory compression for efficiency
Embedding API key is REQUIRED when ENABLE_VECTOR_DB=true
Dimensions parameter is mandatory in embedding configuration

3. 📡 Event System

Real-time event streaming for monitoring and debugging with runtime switching—plug and play!

🎯 Runtime Switching (Plug & Play)

The Beauty of OmniCoreAgent: Switch event router at runtime. Start with in_memory for development, switch to redis_stream for production—all without restarting!

from omnicoreagent import OmniAgent, EventRouter

# Start with in-memory events (fast for development)
agent = OmniAgent(
    name="my_agent",
    event_router=EventRouter("in_memory"),
    model_config={"provider": "openai", "model": "gpt-4o"}
)

# Use in-memory for development...
result = await agent.run("Test query")

# Switch to Redis Streams at runtime for production persistence!
agent.switch_event_store("redis_stream")
result = await agent.run("Now events are persisted in Redis")

# Switch back to in-memory if needed
agent.switch_event_store("in_memory")

Use Cases:

Development → Production: Start with in_memory, switch to redis_stream when deploying
Testing: Quickly switch between backends for testing
Performance Tuning: Use in-memory for speed, Redis for persistence
Cost Management: Use in-memory for development, Redis for production

Event Backends

from omnicoreagent import EventRouter

# In-Memory Events (Development - Fast)
events = EventRouter("in_memory")

# Redis Streams (Production - Persistent)
events = EventRouter("redis_stream")  # Uses REDIS_URL env var

# Runtime Switching - Works seamlessly!
events.switch_event_store("redis_stream")  # Switch to Redis Streams
events.switch_event_store("in_memory")     # Switch back to in-memory

Event Types

# Event Types Available:
# - user_message
# - agent_message
# - tool_call_started
# - tool_call_result
# - tool_call_error
# - final_answer
# - agent_thought
# - background_task_started
# - background_task_completed
# - background_task_error

Usage Examples

# Stream events in real-time
async for event in agent.stream_events(session_id):
    print(f"{event.type}: {event.payload}")

# Get event history
history = await agent.get_events(session_id)

# Switch event backend and continue streaming
agent.switch_event_store("redis_stream")
async for event in agent.stream_events(session_id):
    print(f"Persisted: {event.type}")

4. 🛠️ Local Tools System

from omnicoreagent import ToolRegistry

tool_registry = ToolRegistry()

@tool_registry.register_tool("calculate_area")
def calculate_area(length: float, width: float) -> str:
    """Calculate the area of a rectangle."""
    area = length * width
    return f"Area: {area} square units"

@tool_registry.register_tool(
    name="analyze_data",
    description="Analyze data and return insights",
    inputSchema={
        "type": "object",
        "properties": {
            "data": {"type": "string", "description": "Data to analyze"}
        },
        "required": ["data"]
    }
)
def analyze_data(data: str) -> str:
    """Analyze data and return insights."""
    return f"Analysis: {len(data)} characters processed"

# Use with OmniAgent
agent = OmniAgent(
    name="my_agent",
    system_instruction="You are a helpful assistant.",
    model_config={"provider": "openai", "model": "gpt-4o"},
    local_tools=tool_registry  # Your custom tools!
)

The agent automatically:

✅ Discovers all registered tools
✅ Generates tool schemas
✅ Handles tool execution
✅ Manages errors gracefully
✅ Combines with MCP tools seamlessly

5. 💾 Memory Tool Backend

Persistent working memory for long-running tasks. When to use: Enable this when your agent needs to save intermediate work, track progress across multiple steps, or resume tasks after interruption.

agent_config = {
    "memory_tool_backend": "local"  # Enable persistent memory
}

# Agent automatically gets access to memory_* tools:
# - memory_view: Inspect memory contents
# - memory_create_update: Create/append/overwrite files
# - memory_str_replace: Replace strings in files
# - memory_insert: Insert text at specific line
# - memory_delete: Delete files
# - memory_rename: Rename/move files
# - memory_clear_all: Clear all memory

# Files stored in ./memories/ directory
# Safe, concurrent access with file locks
# Path traversal protection

Use Cases:

Long-running workflows: Save progress as agent works through complex tasks
Resumable tasks: Agent can continue where it left off after interruption
Progress tracking: Monitor multi-step processes
Intermediate state storage: Store data between agent reasoning steps
Multi-step planning: Agent can plan, save plan, execute, and update

Example: A code generation agent can save its plan, write code incrementally, and resume if interrupted.

6. 🚁 Background Agents

Autonomous agents that run independently:

from omnicoreagent import (
    OmniAgent,
    BackgroundAgentService,
    MemoryRouter,
    EventRouter,
    ToolRegistry
)

# Initialize background service
memory_router = MemoryRouter("redis")
event_router = EventRouter("redis_stream")
bg_service = BackgroundAgentService(memory_router, event_router)
bg_service.start_manager()

# Create tool registry
tool_registry = ToolRegistry()

@tool_registry.register_tool("monitor_system")
def monitor_system() -> str:
    """Monitor system resources."""
    import psutil
    return f"CPU: {psutil.cpu_percent()}%, Memory: {psutil.virtual_memory().percent}%"

# Create background agent
agent_config = {
    "agent_id": "system_monitor",
    "system_instruction": "You are a system monitoring agent.",
    "model_config": {
        "provider": "openai",
        "model": "gpt-4o-mini",
        "temperature": 0.3
    },
    "agent_config": {
        "max_steps": 10,
        "tool_call_timeout": 60
    },
    "interval": 300,  # Run every 5 minutes
    "task_config": {
        "query": "Monitor system resources and alert if CPU > 80%",
        "schedule": "every 5 minutes",
        "interval": 300,
        "max_retries": 2,
        "retry_delay": 30
    },
    "local_tools": tool_registry
}

result = await bg_service.create(agent_config)

# Agent Management
bg_service.start_agent("system_monitor")
bg_service.pause_agent("system_monitor")
bg_service.resume_agent("system_monitor")
bg_service.stop_agent("system_monitor")
bg_service.remove_task("system_monitor")

# Status Monitoring
status = bg_service.get_agent_status("system_monitor")
agents = bg_service.list()  # List all agents

Features:

✅ Flexible scheduling (interval-based, cron support planned)
✅ Lifecycle management (create, pause, resume, delete)
✅ Health monitoring
✅ Automatic retries
✅ Event broadcasting
✅ Full OmniAgent capabilities

7. 🔄 Workflow Agents

Orchestrate multiple agents for complex tasks:

SequentialAgent

Chain agents step-by-step:

from omnicoreagent import OmniAgent, SequentialAgent

# Create agents
agent1 = OmniAgent(name="analyzer", ...)
agent2 = OmniAgent(name="processor", ...)
agent3 = OmniAgent(name="reporter", ...)

# Chain them
seq_agent = SequentialAgent(sub_agents=[agent1, agent2, agent3])
await seq_agent.initialize()

result = await seq_agent.run(
    initial_task="Analyze this data and generate a report"
)
# Output from agent1 → input to agent2 → input to agent3

ParallelAgent

Run agents concurrently:

from omnicoreagent import ParallelAgent

par_agent = ParallelAgent(sub_agents=[agent1, agent2, agent3])
await par_agent.initialize()

results = await par_agent.run(
    agent_tasks={
        "analyzer": "Analyze the data",
        "processor": "Process the results",
        "reporter": None  # Uses system instruction
    }
)
# All agents run simultaneously

RouterAgent

Intelligent task routing:

from omnicoreagent import RouterAgent

router = RouterAgent(
    sub_agents=[code_agent, data_agent, research_agent],
    model_config={"provider": "openai", "model": "gpt-4o"},
    agent_config={"max_steps": 10},
    memory_router=MemoryRouter("redis"),
    event_router=EventRouter("redis_stream")
)

await router.initialize()
result = await router.run(task="Find and summarize recent AI research")
# RouterAgent analyzes task and routes to best agent

8. 🧠 Semantic Tool Knowledge Base

Automatically discover and retrieve relevant tools using semantic search. The Problem: When you have hundreds of tools, manually selecting which ones to use is impossible. The Solution: OmniCoreAgent automatically finds the right tools based on what the agent needs to do.

Before (Manual Tool Selection):

# ❌ You have to manually specify which tools to use
# ❌ Doesn't scale with 100+ tools
# ❌ Agent might miss relevant tools
tools = [tool1, tool2, tool3]  # Limited selection

After (Semantic Tool Knowledge Base):

# ✅ Agent automatically finds relevant tools
# ✅ Scales to unlimited tools
# ✅ Context-aware selection
agent_config = {
    "enable_tools_knowledge_base": True,
    "tools_results_limit": 10,
    "tools_similarity_threshold": 0.1
}

# All MCP tools are automatically embedded into vector DB
# Agent uses semantic search to find relevant tools
# Falls back to keyword (BM25) search if needed

How It Works:

All tools are automatically embedded into a vector database
When agent needs tools, it searches semantically (by meaning)
Returns most relevant tools for the current task
Falls back to keyword search if semantic search finds nothing

Benefits:

✅ Scales to unlimited tools: Works with 10 or 10,000 tools
✅ Context-aware tool selection: Finds tools based on task meaning
✅ No manual registry management: Automatic tool discovery
✅ Automatic tool indexing: New tools are automatically available

9. 📊 Production Observability

Opik Tracing & Observability Setup

Monitor and optimize your AI agents with production-grade observability:

🚀 Quick Setup:

Sign up for Opik (Free & Open Source):
- Visit: https://www.comet.com/signup?from=llm
- Create your account and get your API key and workspace name

Add to your .env file:

OPIK_API_KEY=your_opik_api_key_here
OPIK_WORKSPACE=your_opik_workspace_name

✨ What You Get Automatically:

Once configured, OmniCoreAgent automatically tracks:

🔥 LLM Call Performance: Execution time, token usage, response quality
🛠️ Tool Execution Traces: Which tools were used and how long they took
🧠 Memory Operations: Vector DB queries, memory retrieval performance
🤖 Agent Workflow: Complete trace of multi-step agent reasoning
📊 System Bottlenecks: Identify exactly where time is spent

📈 Benefits:

Performance Optimization: See which LLM calls or tools are slow
Cost Monitoring: Track token usage and API costs
Debugging: Understand agent decision-making processes
Production Monitoring: Real-time observability for deployed agents
Zero Code Changes: Works automatically with existing agents

🔍 Example: What You'll See

Agent Execution Trace:
├── agent_execution: 4.6s
│   ├── tools_registry_retrieval: 0.02s ✅
│   ├── memory_retrieval_step: 0.08s ✅
│   ├── llm_call: 4.5s ⚠️ (bottleneck identified!)
│   ├── response_parsing: 0.01s ✅
│   └── action_execution: 0.03s ✅

💡 Pro Tip: Opik is completely optional. If you don't set the credentials, OmniCoreAgent works normally without tracing.

Metrics & Monitoring

OmniCoreAgent provides built-in metrics collection for production monitoring:

# Metrics are automatically collected when using production examples
# See examples/devops_copilot_agent/ for Prometheus integration
# See examples/deep_code_agent/ for comprehensive monitoring

# Key Metrics Tracked:
# - Request count and latency
# - Token usage per request
# - Tool execution time
# - Memory operations
# - Error rates
# - Agent step counts

Production Examples Include:

Prometheus-compatible endpoints: /metrics endpoint for scraping
Real-time performance tracking: Monitor agent performance live
Custom metrics: Add your own business metrics
Alerting: Set up alerts based on metrics

See Production Examples for complete implementations.

🎯 Production Examples

1. DevOps Copilot Agent

A production-ready DevOps assistant with:

✅ Safe bash command execution
✅ Rate limiting
✅ Audit logging
✅ Prometheus metrics
✅ Health checks
✅ Redis persistence

Location: examples/devops_copilot_agent/

How to Run:

cd examples/devops_copilot_agent
# Follow the README in that directory for setup instructions

Prerequisites:

Redis running (for persistence)
LLM API key configured
Docker (for deployment)

Features:

Configuration management
Observability (metrics, logging, audit)
Security (rate limiting, command filtering)
Health checks
Docker deployment

2. Deep Code Agent

An advanced coding agent with:

✅ Sandbox code execution
✅ Memory tool backend
✅ Session workspaces
✅ Code analysis tools
✅ Test execution
✅ Production observability

Location: examples/deep_code_agent/

How to Run:

cd examples/deep_code_agent
# Follow the README in that directory for setup instructions

Prerequisites:

LLM API key configured
Optional: Vector database for enhanced memory

Features:

Secure sandbox execution
Persistent working memory
Code generation and testing
Full observability stack

10. 🌐 Universal Model Support

OmniCoreAgent is model-agnostic by design through LiteLLM integration. Use OpenAI, Anthropic Claude, Google Gemini, Ollama, or any LLM provider. Switch models without changing agent code. Your logic stays consistent regardless of the underlying model.

Multi-Provider LLM Support

OmniCoreAgent uses LiteLLM for unified access to 100+ models:

model_config = {
    # OpenAI
    "provider": "openai",
    "model": "gpt-4o",
    
    # Anthropic
    "provider": "anthropic",
    "model": "claude-3-5-sonnet-20241022",
    
    # Groq (Ultra-fast)
    "provider": "groq",
    "model": "llama-3.1-8b-instant",
    
    # Azure OpenAI
    "provider": "azureopenai",
    "model": "gpt-4",
    "azure_endpoint": "https://your-resource.openai.azure.com",
    
    # Ollama (Local)
    "provider": "ollama",
    "model": "llama3.1:8b",
    "ollama_host": "http://localhost:11434",
    
    # OpenRouter (200+ models)
    "provider": "openrouter",
    "model": "anthropic/claude-3.5-sonnet"
}

Supported Providers:

OpenAI (GPT-4, GPT-3.5, etc.)
Anthropic (Claude 3.5 Sonnet, Claude 3 Haiku, etc.)
Google (Gemini Pro, Gemini Flash)
Groq (Llama, Mixtral, Gemma)
DeepSeek (DeepSeek-V3, DeepSeek-Coder)
Mistral
Azure OpenAI
OpenRouter (200+ models)
Ollama (local models)

Why This Matters:

Cost Optimization: Use cheaper models for simple tasks, powerful models for complex reasoning
Flexibility: Switch providers without code changes
Consistency: Same agent logic works across all providers
Future-Proof: New models automatically supported through LiteLLM

Embedding Support

OmniCoreAgent supports multiple embedding providers for vector database operations:

# .env
EMBEDDING_API_KEY=your_key  # Works with OpenAI, Cohere, HuggingFace, Mistral, Voyage, etc.

Supported Embedding Providers:

OpenAI: text-embedding-3-small, text-embedding-3-large, text-embedding-ada-002
Cohere: embed-english-v3.0, embed-multilingual-v3.0
HuggingFace: Various models via HuggingFace API
Mistral: mistral-embed
Voyage: voyage-large-2, voyage-code-2
Azure OpenAI: Azure-hosted OpenAI embeddings
Google: Vertex AI embeddings

Embedding Configuration:

embedding_config = {
    "provider": "openai",              # Provider name
    "model": "text-embedding-3-small",  # Model name
    "dimensions": 1536,                 # REQUIRED: Vector dimensions
    "encoding_format": "float",         # Encoding format
    "timeout": 30                       # Optional timeout
}

Important: When ENABLE_VECTOR_DB=true, embedding configuration is REQUIRED. The dimensions parameter is mandatory for vector database index creation.

11. 🔌 Built-in MCP Client

OmniCoreAgent includes built-in support for Model Context Protocol (MCP) servers, allowing your agents to seamlessly connect to external tools and services without additional setup. This is a core feature that enables your agents to interact with filesystems, databases, APIs, and any MCP-compatible service.

Quick Integration

agent = OmniAgent(
    name="my_agent",
    system_instruction="You are a helpful assistant.",
    model_config={"provider": "openai", "model": "gpt-4o"},
    mcp_tools=[
        # stdio Transport - Local MCP servers
        {
            "name": "filesystem",
            "transport_type": "stdio",
            "command": "npx",
            "args": ["-y", "@modelcontextprotocol/server-filesystem", "/home"]
        },
        # HTTP Transport - Remote MCP servers
        {
            "name": "github",
            "transport_type": "streamable_http",
            "url": "http://localhost:8080/mcp",
            "headers": {"Authorization": "Bearer your-token"}
        }
    ]
)

await agent.connect_mcp_servers()  # Connect to all servers
tools = await agent.list_all_available_tools()  # Get all tools (MCP + local)

Supported Transports

stdio: Direct process communication (local MCP servers)
sse: Server-Sent Events (HTTP-based streaming)
streamable_http: HTTP with streaming support
docker: Container-based servers
npx: NPX package execution

Authentication Methods

OAuth 2.0: Automatic callback server (starts on localhost:3000)
Bearer Tokens: Simple token-based authentication
Custom Headers: Flexible header configuration

Key Benefits

✅ No Additional Setup: Built directly into OmniAgent
✅ Seamless Integration: MCP tools work alongside local tools
✅ Multiple Transports: Support for stdio, HTTP, SSE, Docker, NPX
✅ Automatic Discovery: All MCP tools automatically available to agent
✅ Unified Tool Access: Use list_all_available_tools() to see MCP + local tools
✅ Production Ready: Handles connection errors, retries, and cleanup

Example: Agent with Filesystem Access

agent = OmniAgent(
    name="file_agent",
    system_instruction="You can read and write files.",
    model_config={"provider": "openai", "model": "gpt-4o"},
    mcp_tools=[{
        "name": "filesystem",
        "transport_type": "stdio",
        "command": "npx",
        "args": ["-y", "@modelcontextprotocol/server-filesystem", "/home/user"]
    }]
)

await agent.connect_mcp_servers()
result = await agent.run("List all Python files in the current directory")
# Agent can now use filesystem tools from MCP server

Example: Agent with Multiple MCP Servers

agent = OmniAgent(
    name="multi_tool_agent",
    system_instruction="You have access to filesystem and GitHub.",
    model_config={"provider": "openai", "model": "gpt-4o"},
    mcp_tools=[
        {
            "name": "filesystem",
            "transport_type": "stdio",
            "command": "npx",
            "args": ["-y", "@modelcontextprotocol/server-filesystem", "/home"]
        },
        {
            "name": "github",
            "transport_type": "streamable_http",
            "url": "http://localhost:8080/mcp",
            "headers": {"Authorization": "Bearer github-token"}
        }
    ]
)

await agent.connect_mcp_servers()
# Agent now has access to both filesystem and GitHub tools

Note: For standalone MCP client usage (CLI tool), see the MCP Client CLI section at the end of this document.

🚀 Advanced Features

Advanced MCP Server Management

For dynamic server management and advanced MCP client features, see the MCP Client CLI section below. The built-in MCP client in OmniAgent (documented in Core Features - Built-in MCP Client) handles most use cases, while the CLI provides additional capabilities for:

Dynamic server addition/removal at runtime
Interactive MCP server management
Advanced prompt and resource management
Standalone MCP client usage

Dynamic Server Configuration (CLI)

Add New Servers

# Add one or more servers from a configuration file
/add_servers:path/to/config.json

The configuration file can include multiple servers with different authentication methods:

{
  "new-server": {
    "transport_type": "streamable_http",
    "auth": {
      "method": "oauth"
    },
    "url": "http://localhost:8000/mcp"
  },
  "another-server": {
    "transport_type": "sse",
    "headers": {
      "Authorization": "Bearer token"
    },
    "url": "http://localhost:3000/sse"
  }
}

Remove Servers

# Remove a server by its name
/remove_server:server_name

Programmatic Configuration

# Add servers at runtime
await client.add_servers("path/to/config.json")

# Remove servers
await client.remove_server("server_name")

📦 Installation & Setup

Requirements

Python 3.10+
LLM API key (OpenAI, Anthropic, Groq, etc.)

Optional Dependencies (For Advanced Features)

Redis: For persistent memory and events (memory_store_type="redis", event_store_type="redis_stream")
PostgreSQL/MySQL: For database memory (memory_store_type="database")
MongoDB: For document storage (memory_store_type="mongodb")
Qdrant/ChromaDB: For vector database (semantic search and long-term memory)
Opik: For tracing and observability (production monitoring)

Installation

# Using uv (recommended)
uv add omnicoreagent

# Or with pip
pip install omnicoreagent

Environment Variables

# ===============================================
# REQUIRED: AI Model API Key (Choose one provider)
# ===============================================
LLM_API_KEY=your_openai_api_key_here
# OR for other providers:
# LLM_API_KEY=your_anthropic_api_key_here
# LLM_API_KEY=your_groq_api_key_here
# LLM_API_KEY=your_azure_openai_api_key_here

# ===============================================
# OPTIONAL: Embeddings (For Vector Database)
# ===============================================
# REQUIRED when ENABLE_VECTOR_DB=true
EMBEDDING_API_KEY=your_embedding_api_key_here
# Works with OpenAI, Cohere, HuggingFace, Mistral, Voyage, etc.

# ===============================================
# OPTIONAL: Vector Database (Smart Memory)
# ===============================================
# ⚠️ Warning: 30-60s startup time for sentence transformer
# ⚠️ IMPORTANT: You MUST choose a provider - no local fallback
ENABLE_VECTOR_DB=true  # Default: false

# Choose ONE provider (required if ENABLE_VECTOR_DB=true):
# Option 1: Qdrant Remote (RECOMMENDED)
OMNI_MEMORY_PROVIDER=qdrant-remote
QDRANT_HOST=localhost
QDRANT_PORT=6333

# Option 2: ChromaDB Remote
# OMNI_MEMORY_PROVIDER=chroma-remote
# CHROMA_HOST=localhost
# CHROMA_PORT=8000

# Option 3: ChromaDB Cloud
# OMNI_MEMORY_PROVIDER=chroma-cloud
# CHROMA_TENANT=your_tenant
# CHROMA_DATABASE=your_database
# CHROMA_API_KEY=your_api_key

# Option 4: MongoDB Atlas
# OMNI_MEMORY_PROVIDER=mongodb-remote
# MONGODB_URI="your_mongodb_connection_string"
# MONGODB_DB_NAME="db name"

# ===============================================
# OPTIONAL: Persistent Memory Storage
# ===============================================
# Redis - for memory_store_type="redis" (defaults to: redis://localhost:6379/0)
# REDIS_URL=redis://your-remote-redis:6379/0
# REDIS_URL=redis://:password@localhost:6379/0  # With password

# Database - for memory_store_type="database"
# DATABASE_URL=sqlite:///omnicoreagent_memory.db
# DATABASE_URL=postgresql://user:password@localhost:5432/omnicoreagent
# DATABASE_URL=mysql://user:password@localhost:3306/omnicoreagent

# MongoDB - for memory_store_type="mongodb" (defaults to: mongodb://localhost:27017/omnicoreagent)
# MONGODB_URI="your_mongodb_connection_string"
# MONGODB_DB_NAME="db name"

# ===============================================
# OPTIONAL: Tracing & Observability
# ===============================================
# For advanced monitoring and performance optimization
# 🔗 Sign up: https://www.comet.com/signup?from=llm
OPIK_API_KEY=your_opik_api_key_here
OPIK_WORKSPACE=your_opik_workspace_name

💡 Quick Start: Just set LLM_API_KEY and you're ready to go! Add other variables only when you need advanced features.

📚 Examples

Basic Agent

import asyncio
from omnicoreagent import OmniAgent

async def main():
    agent = OmniAgent(
        name="assistant",
        system_instruction="You are a helpful assistant.",
        model_config={"provider": "openai", "model": "gpt-4o"}
    )
    
    # Run without session_id (auto-generated)
    result = await agent.run("What is the capital of France?")
    print(result["response"])
    print(f"Session ID: {result['session_id']}")
    
    # Run with same session_id for context continuity
    result2 = await agent.run("What is its population?", session_id=result["session_id"])
    print(result2["response"])
    
    await agent.cleanup()

asyncio.run(main())

Note: session_id is optional. If omitted, a new session is created automatically. Use the same session_id across multiple calls to maintain conversation context.

Agent with Custom Tools

from omnicoreagent import OmniAgent, ToolRegistry

tool_registry = ToolRegistry()

@tool_registry.register_tool("get_weather")
def get_weather(city: str) -> str:
    """Get weather for a city."""
    return f"Weather in {city}: Sunny, 25°C"

agent = OmniAgent(
    name="weather_agent",
    system_instruction="You help users with weather information.",
    model_config={"provider": "openai", "model": "gpt-4o"},
    local_tools=tool_registry
)

result = await agent.run("What's the weather in Tokyo?")

Agent with MCP Servers

agent = OmniAgent(
    name="mcp_agent",
    system_instruction="You have access to filesystem and GitHub tools.",
    model_config={"provider": "openai", "model": "gpt-4o"},
    mcp_tools=[
        {
            "name": "filesystem",
            "transport_type": "stdio",
            "command": "npx",
            "args": ["-y", "@modelcontextprotocol/server-filesystem", "/home"]
        }
    ]
)

await agent.connect_mcp_servers()
result = await agent.run("List files in /home directory")

Production Agent with All Features

from omnicoreagent import (
    OmniAgent,
    ToolRegistry,
    MemoryRouter,
    EventRouter
)

# Memory & Events
memory = MemoryRouter("redis")
events = EventRouter("redis_stream")

# Custom Tools
tools = ToolRegistry()
@tools.register_tool("analyze")
def analyze(data: str) -> str:
    return f"Analyzed: {len(data)} chars"

# Agent
agent = OmniAgent(
    name="production_agent",
    system_instruction="You are a production agent.",
    model_config={"provider": "openai", "model": "gpt-4o"},
    local_tools=tools,
    mcp_tools=[...],
    memory_router=memory,
    event_router=events,
    agent_config={
        "max_steps": 20,
        "enable_tools_knowledge_base": True,
        "memory_tool_backend": "local",
        "memory_results_limit": 10
    }
)

await agent.connect_mcp_servers()
result = await agent.run("Process this data", session_id="user_123")

🎓 Learning Resources

Example Projects

Basic Examples

# Simple introduction
python examples/cli/basic.py

# Complete OmniAgent demo - All features showcase
python examples/cli/run_omni_agent.py

# Advanced MCP CLI
python examples/cli/run_mcp.py

Custom Agents

# Background agents
python examples/background_agent_example.py

# Custom agent implementations
python examples/custom_agents/e_commerce_personal_shopper_agent.py
python examples/custom_agents/flightBooking_agent.py
python examples/custom_agents/real_time_customer_support_agent.py

Workflow Agents

# Sequential agent chaining
python examples/workflow_agents/sequential_agent.py

# Parallel agent execution
python examples/workflow_agents/parallel_agent.py

# Router agent (intelligent routing)
python examples/workflow_agents/router_agent.py

Web Applications

# FastAPI integration
python examples/fast_api_iml.py

# Full web interface
python examples/enhanced_web_server.py
# Open http://localhost:8000

Production Examples

DevOps Copilot Agent: examples/devops_copilot_agent/
- Production-ready DevOps assistant
- Safe bash command execution
- Rate limiting, audit logging, Prometheus metrics
- Health checks, Redis persistence
Deep Code Agent: examples/deep_code_agent/
- Advanced coding agent
- Sandbox code execution
- Memory tool backend
- Session workspaces
- Code analysis and test execution

Documentation

Getting Started: See examples above
API Reference: Check source code docstrings
Architecture: See docs/advanced/architecture.md
Complete Documentation: OmniCoreAgent Docs

Build Documentation Locally

# Install documentation dependencies
pip install mkdocs mkdocs-material

# Serve documentation locally
mkdocs serve
# Open http://127.0.0.1:8000

# Build static documentation
mkdocs build

🔧 Configuration Reference

Agent Configuration

agent_config = {
    # Execution
    "agent_name": "my_agent",
    "max_steps": 15,
    "tool_call_timeout": 30,
    "request_limit": 0,  # 0 = unlimited
    "total_tokens_limit": 0,  # 0 = unlimited
    
    # Memory
    "memory_config": {"mode": "sliding_window", "value": 10000},
    "memory_results_limit": 5,
    "memory_similarity_threshold": 0.5,
    
    # Tools
    "enable_tools_knowledge_base": True,
    "tools_results_limit": 10,
    "tools_similarity_threshold": 0.1,
    
    # Memory Tool
    "memory_tool_backend": "local"  # or "s3", "db", None
}

Model Configuration

model_config = {
    "provider": "openai",  # or anthropic, groq, etc.
    "model": "gpt-4o",
    "temperature": 0.7,
    "max_tokens": 2000,
    "top_p": 0.95
}

MCP Tool Configuration

Basic stdio Configuration

mcp_tools = [
    {
        "name": "filesystem",
        "transport_type": "stdio",
        "command": "npx",
        "args": ["-y", "@modelcontextprotocol/server-filesystem", "/home"]
    }
]

HTTP Configuration with Bearer Token

mcp_tools = [
    {
        "name": "github",
        "transport_type": "streamable_http",
        "url": "http://localhost:8080/mcp",
        "headers": {
            "Authorization": "Bearer your-token"
        },
        "timeout": 60
    }
]

HTTP Configuration with OAuth

mcp_tools = [
    {
        "name": "oauth_server",
        "transport_type": "streamable_http",
        "auth": {
            "method": "oauth"
        },
        "url": "http://localhost:8000/mcp"
    }
]

SSE Configuration

mcp_tools = [
    {
        "name": "sse_server",
        "transport_type": "sse",
        "url": "http://localhost:3000/sse",
        "headers": {
            "Authorization": "Bearer token"
        },
        "timeout": 60,
        "sse_read_timeout": 120
    }
]

Server Configuration JSON Examples

Complete Configuration with Multiple Providers

{
  "AgentConfig": {
    "tool_call_timeout": 30,
    "max_steps": 15,
    "request_limit": 0,
    "total_tokens_limit": 0,
    "memory_results_limit": 5,
    "memory_similarity_threshold": 0.5,
    "enable_tools_knowledge_base": true,
    "tools_results_limit": 10,
    "tools_similarity_threshold": 0.1,
    "memory_tool_backend": "local"
  },
  "LLM": {
    "provider": "openai",
    "model": "gpt-4o",
    "temperature": 0.7,
    "max_tokens": 2000,
    "max_context_length": 30000,
    "top_p": 0.95
  },
  "Embedding": {
    "provider": "openai",
    "model": "text-embedding-3-small",
    "dimensions": 1536,
    "encoding_format": "float"
  },
  "mcpServers": {
    "filesystem": {
      "transport_type": "stdio",
      "command": "npx",
      "args": ["-y", "@modelcontextprotocol/server-filesystem", "/home"]
    },
    "github": {
      "transport_type": "streamable_http",
      "url": "http://localhost:8080/mcp",
      "headers": {
        "Authorization": "Bearer your-token"
      }
    }
  }
}

Anthropic Claude Configuration

{
  "LLM": {
    "provider": "anthropic",
    "model": "claude-3-5-sonnet-20241022",
    "temperature": 0.7,
    "max_tokens": 4000,
    "max_context_length": 200000,
    "top_p": 0.95
  }
}

Groq Configuration (Ultra-Fast)

{
  "LLM": {
    "provider": "groq",
    "model": "llama-3.1-8b-instant",
    "temperature": 0.5,
    "max_tokens": 2000,
    "max_context_length": 8000,
    "top_p": 0.9
  }
}

Azure OpenAI Configuration

{
  "LLM": {
    "provider": "azureopenai",
    "model": "gpt-4",
    "temperature": 0.7,
    "max_tokens": 2000,
    "max_context_length": 100000,
    "top_p": 0.95,
    "azure_endpoint": "https://your-resource.openai.azure.com",
    "azure_api_version": "2024-02-01",
    "azure_deployment": "your-deployment-name"
  }
}

Ollama Local Model Configuration

{
  "LLM": {
    "provider": "ollama",
    "model": "llama3.1:8b",
    "temperature": 0.5,
    "max_tokens": 5000,
    "max_context_length": 100000,
    "top_p": 0.7,
    "ollama_host": "http://localhost:11434"
  }
}

OpenRouter Configuration (200+ Models)

{
  "LLM": {
    "provider": "openrouter",
    "model": "anthropic/claude-3.5-sonnet",
    "temperature": 0.7,
    "max_tokens": 4000,
    "max_context_length": 200000,
    "top_p": 0.95
  }
}

Configuration Examples by Use Case

Local Development (stdio)

{
  "mcpServers": {
    "local-tools": {
      "transport_type": "stdio",
      "command": "uvx",
      "args": ["mcp-server-tools"]
    }
  }
}

Remote Server with Token

{
  "mcpServers": {
    "remote-api": {
      "transport_type": "streamable_http",
      "url": "http://api.example.com:8080/mcp",
      "headers": {
        "Authorization": "Bearer abc123token"
      }
    }
  }
}

Remote Server with OAuth

{
  "mcpServers": {
    "oauth-server": {
      "transport_type": "streamable_http",
      "auth": {
        "method": "oauth"
      },
      "url": "http://oauth-server.com:8080/mcp"
    }
  }
}

🧪 Testing

Running Tests

# Run all tests with verbose output
pytest tests/ -v

# Run specific test file
pytest tests/test_specific_file.py -v

# Run tests with coverage report
pytest tests/ --cov=src --cov-report=term-missing

Test Structure

tests/
├── unit/           # Unit tests for individual components
├── omni_agent/     # OmniAgent system tests
├── mcp_client/     # MCPOmni Connect system tests
└── integration/    # Integration tests for both systems

Development Quick Start

Installation

# Clone the repository
git clone https://github.com/omnirexflora-labs/omnicoreagent.git
cd omnicoreagent

# Create and activate virtual environment
uv venv
source .venv/bin/activate

# Install dependencies
uv sync --dev

Configuration

# Set up environment variables
echo "LLM_API_KEY=your_api_key_here" > .env

# Configure your servers in servers_config.json

Start Systems

# Try OmniAgent
python examples/cli/run_omni_agent.py

# Or try MCPOmni Connect
python examples/cli/run_mcp.py

🔍 Troubleshooting

🚨 Most Common Issues: Check Quick Fixes below first!

🚨 Quick Fixes (Common Issues)

Error	Quick Fix
`Error: Invalid API key`	Check your `.env` file: `LLM_API_KEY=your_actual_key`
`ModuleNotFoundError: omnicoreagent`	Run: `uv add omnicoreagent` or `pip install omnicoreagent`
`Connection refused`	Ensure MCP server is running before connecting
`ChromaDB not available`	Install: `pip install chromadb` - See Vector DB Setup
`Redis connection failed`	Install Redis or use in-memory mode (default)
`Tool execution failed`	Check tool permissions and arguments
`Vector database connection failed`	Check `ENABLE_VECTOR_DB` and provider settings in `.env`
`Embedding configuration required`	Set `EMBEDDING_API_KEY` and configure `embedding_config` when using vector DB

Detailed Issues and Solutions

1. Connection Issues

Error: Could not connect to MCP server

Solutions:

Check if the server is running
Verify server configuration in servers_config.json or mcp_tools parameter
Ensure network connectivity
Check server logs for errors
See Transport Types & Authentication for detailed setup

2. API Key Issues

Error: Invalid API key

Solutions:

Verify API key is correctly set in .env
Check if API key has required permissions
Ensure API key is for correct environment (production/development)
For Azure OpenAI, verify azure_endpoint, azure_api_version, and azure_deployment
See Configuration Guide for correct setup

3. Redis Connection

Error: Could not connect to Redis

Solutions:

Verify Redis server is running: redis-cli ping
Check Redis connection settings in .env: REDIS_URL=redis://localhost:6379/0
Ensure Redis password is correct (if configured)
Use in-memory mode as fallback: MemoryRouter("in_memory")

4. Tool Execution Failures

Error: Tool execution failed

Solutions:

Check tool availability on connected servers: /tools command
Verify tool permissions
Review tool arguments for correctness
Check tool timeout settings in agent_config

5. Vector Database Issues

Error: Vector database connection failed

Solutions:

Ensure chosen provider (Qdrant, ChromaDB, MongoDB) is running
Check connection settings in .env
Verify API keys for cloud providers
Ensure ENABLE_VECTOR_DB=true is set
Verify EMBEDDING_API_KEY is set when using vector DB
See Vector Database Setup for detailed configuration

6. Import Errors

ImportError: cannot import name 'OmniAgent'

Solutions:

Check package installation: pip show omnicoreagent
Verify Python version compatibility (3.10+)
Try reinstalling: pip uninstall omnicoreagent && pip install omnicoreagent
Check virtual environment is activated

7. OAuth Server Behavior

Question: "Started callback server on http://localhost:3000" - Is This Normal?

Answer: Yes, this is completely normal when:

You have "auth": {"method": "oauth"} in any server configuration
The OAuth server handles authentication tokens automatically
You cannot and should not try to change this address

If you don't want the OAuth server:

Remove "auth": {"method": "oauth"} from all server configurations
Use alternative authentication methods like Bearer tokens

8. Memory Tool Backend Issues

Error: Memory tool backend not available

Solutions:

Verify memory_tool_backend is set correctly: "local", "s3", "db", or None
For local backend, ensure write permissions in project directory
Check that ./memories/ directory can be created

Debug Mode

Enable debug mode for detailed logging:

# In MCPOmni Connect CLI
/debug

# In OmniAgent
agent = OmniAgent(..., debug=True)

Getting Help

First: Check the Quick Fixes above
Examples: Study working examples in the examples/ directory
Issues: Search GitHub Issues for similar problems
New Issue: Create a new issue with detailed information

🤝 Contributing

We welcome contributions! See CONTRIBUTING.md for guidelines.

Development Setup

# Fork and clone the repository
git clone https://github.com/omnirexflora-labs/omnicoreagent.git
cd omnicoreagent

# Set up development environment
uv venv
source .venv/bin/activate
uv sync --dev

# Install pre-commit hooks
pre-commit install

Contribution Areas

OmniAgent System: Custom agents, local tools, background processing
MCPOmni Connect: MCP client features, transport protocols, authentication
Shared Infrastructure: Memory systems, vector databases, event handling
Documentation: Examples, tutorials, API documentation
Testing: Unit tests, integration tests, performance tests

Pull Request Process

Create a feature branch from main
Make your changes with tests
Run the test suite: pytest tests/ -v
Update documentation as needed
Submit a pull request with a clear description

Code Standards

Python 3.10+ compatibility
Type hints for all public APIs
Comprehensive docstrings
Unit tests for new functionality
Follow existing code style

📄 License

MIT License - see LICENSE file for details.

👨‍💻 Author & Credits

Created by Abiola Adeshina

OmniCoreAgent is built by the OmniCoreAgent Team - the same team behind powerful AI agent frameworks and event-driven systems.

🌟 Related Projects:

OmniCoreAgent - Production-ready AI agent framework with built-in MCP client, multi-tier memory, and workflow orchestration (this project)
OmniDaemon - Universal event-driven runtime engine for AI agents

💡 OmniCoreAgent and OmniDaemon are designed to work seamlessly together, providing a complete AI agent development ecosystem!

Connect with the creator:

GitHub: @Abiorh001
X (Twitter): @abiorhmangana
Website: mintify.com
Email: [email protected]
Documentation: omnirexflora-labs.github.io/omnicoreagent

🙏 Acknowledgments

OmniCoreAgent is built on the shoulders of giants:

LiteLLM - Unified LLM interface for 100+ models
FastAPI - Modern Python web framework
Redis - In-memory data store and message broker
Qdrant - Vector database for semantic search
ChromaDB - Embedding database
Opik - Production observability and tracing
Pydantic - Data validation
APScheduler - Advanced Python scheduler

And all the amazing open-source projects that make OmniCoreAgent possible!

🌟 Why OmniCoreAgent?

What Sets It Apart

True Autonomy: Agents don't just respond—they plan multi-step workflows, use tools to gather information, validate results, and adapt their approach based on outcomes.

Composable Architecture: Build small, focused agents and compose them into sophisticated systems. A file system agent, data analysis agent, and reporting agent work together through routing, each handling what it does best.

Full Control: Create custom tools, define specialized routing logic, integrate any external service, and extend the framework to match your exact requirements. No black boxes.

Production-Ready: Not an experimental framework—built for real applications. Includes proper error handling, retry logic, session management, and observability. Deploy confidently knowing agents handle edge cases gracefully.

Framework Agnostic: Works seamlessly with FastAPI for web APIs, event-driven architectures, or any Python application. Build agents that respond to events, serve HTTP requests, or run as background services—same core agent definition.

Cost Optimized: Smart context management and model switching reduce LLM costs. Use cheaper models for simple tasks, powerful models for complex reasoning, and maintain only necessary context in memory.

Clean Developer Experience: Abstract away LLM orchestration complexity while staying flexible. Define agent behavior, tools, and routing in clean Python code. The framework handles prompt management, tool calling, error handling, and model interactions.

Perfect For

Enterprise AI Applications: Production-ready agents for business automation
Intelligent Automation: Autonomous agents that handle complex workflows
Customer Service Systems: AI-powered support with tool integration
Data Analysis Workflows: Agents that analyze, process, and report on data
Development Assistants: Code generation, testing, and DevOps automation
Multi-Agent Systems: Complex orchestration with specialized agents
Any scenario requiring AI agents that think, decide, and act autonomously

🔌 MCP Client CLI (Backward Compatibility)

Note: The MCP Client CLI is available for backward compatibility and standalone MCP server management. For building AI agents, use OmniAgent (documented above). The MCP client functionality is also integrated into OmniAgent via the mcp_tools parameter.

🖥️ MCP Client CLI Commands

When using the standalone MCP client CLI (via python examples/cli/run_mcp.py), you have access to powerful interactive commands:

Memory Store Management

# Switch between memory backends
/memory_store:in_memory                    # Fast in-memory storage (default)
/memory_store:redis                        # Redis persistent storage  
/memory_store:database                     # SQLite database storage
/memory_store:database:postgresql://user:pass@host/db  # PostgreSQL
/memory_store:database:mysql://user:pass@host/db       # MySQL
/memory_store:mongodb                      # MongoDB persistent storage
/memory_store:mongodb:your_mongodb_connection_string   # MongoDB with custom URI

# Memory strategy configuration
/memory_mode:sliding_window:10             # Keep last 10 messages
/memory_mode:token_budget:5000             # Keep under 5000 tokens

Event Store Management

# Switch between event backends
/event_store:in_memory                     # Fast in-memory events (default)
/event_store:redis_stream                  # Redis Streams for persistence

Core MCP Operations

/tools                                    # List all available tools
/prompts                                  # List all available prompts  
/resources                               # List all available resources
/prompt:<name>                           # Execute a specific prompt
/resource:<uri>                          # Read a specific resource
/subscribe:<uri>                         # Subscribe to resource updates
/query <your_question>                   # Ask questions using tools

Enhanced Commands

# Memory operations
/history                                   # Show conversation history
/clear_history                            # Clear conversation history
/save_history <file>                      # Save history to file
/load_history <file>                      # Load history from file

# Server management
/add_servers:<config.json>                # Add servers from config
/remove_server:<server_name>              # Remove specific server
/refresh                                  # Refresh server capabilities

# Agentic modes
/mode:auto                              # Switch to autonomous agentic mode
/mode:orchestrator                      # Switch to multi-server orchestration
/mode:chat                              # Switch to interactive chat mode

# Debugging and monitoring
/debug                                    # Toggle debug mode
/api_stats                               # Show API usage statistics

🚦 Transport Types & Authentication

The MCP Client supports multiple transport protocols for connecting to MCP servers:

1. stdio - Direct Process Communication

Use when: Connecting to local MCP servers that run as separate processes

{
  "server-name": {
    "transport_type": "stdio",
    "command": "uvx",
    "args": ["mcp-server-package"]
  }
}

No authentication needed
No OAuth server started
Most common for local development

2. sse - Server-Sent Events

Use when: Connecting to HTTP-based MCP servers using Server-Sent Events

{
  "server-name": {
    "transport_type": "sse",
    "url": "http://your-server.com:4010/sse",
    "headers": {
      "Authorization": "Bearer your-token"
    },
    "timeout": 60,
    "sse_read_timeout": 120
  }
}

Uses Bearer token or custom headers
No OAuth server started

3. streamable_http - HTTP with Optional OAuth

Use when: Connecting to HTTP-based MCP servers with or without OAuth

Without OAuth (Bearer Token):

{
  "server-name": {
    "transport_type": "streamable_http",
    "url": "http://your-server.com:4010/mcp",
    "headers": {
      "Authorization": "Bearer your-token"
    },
    "timeout": 60
  }
}

Uses Bearer token or custom headers
No OAuth server started

With OAuth:

{
  "server-name": {
    "transport_type": "streamable_http",
    "auth": {
      "method": "oauth"
    },
    "url": "http://your-server.com:4010/mcp"
  }
}

OAuth callback server automatically starts on http://localhost:3000
This is hardcoded and cannot be changed
Required for OAuth flow to work properly

🔐 OAuth Server Behavior

Important: When using OAuth authentication, the MCP Client automatically starts an OAuth callback server.

What You'll See:

🖥️  Started callback server on http://localhost:3000

Key Points:

This is normal behavior - not an error
The address http://localhost:3000 is hardcoded and cannot be changed
The server only starts when you have "auth": {"method": "oauth"} in your config
The server stops when the application shuts down
Only used for OAuth token handling - no other purpose

When OAuth is NOT Used:

Remove the entire "auth" section from your server configuration
Use "headers" with "Authorization": "Bearer token" instead
No OAuth server will start

💬 Prompt Management

The MCP Client provides advanced prompt handling with flexible argument parsing:

Basic Prompt Usage

# List all available prompts
/prompts

# Basic prompt usage
/prompt:weather/location=tokyo

# Prompt with multiple arguments
/prompt:travel-planner/from=london/to=paris/date=2024-03-25

JSON Format for Complex Arguments

# JSON format for complex arguments
/prompt:analyze-data/{
    "dataset": "sales_2024",
    "metrics": ["revenue", "growth"],
    "filters": {
        "region": "europe",
        "period": "q1"
    }
}

# Nested argument structures
/prompt:market-research/target=smartphones/criteria={
    "price_range": {"min": 500, "max": 1000},
    "features": ["5G", "wireless-charging"],
    "markets": ["US", "EU", "Asia"]
}

Advanced Prompt Features

Argument Validation: Automatic type checking and validation
Default Values: Smart handling of optional arguments
Context Awareness: Prompts can access previous conversation context
Cross-Server Execution: Seamless execution across multiple MCP servers
Error Handling: Graceful handling of invalid arguments with helpful messages
Dynamic Help: Detailed usage information for each prompt

🎯 Operation Modes

The MCP Client supports three distinct operation modes for different use cases:

Chat Mode (Default)

Characteristics:

Requires explicit approval for tool execution
Interactive conversation style
Step-by-step task execution
Detailed explanations of actions
Best for: Learning, debugging, controlled execution

Usage:

/mode:chat

Autonomous Mode

Characteristics:

Independent task execution
Self-guided decision making
Automatic tool selection and chaining
Progress updates and final results
Complex task decomposition
Error handling and recovery
Best for: Production automation, batch processing

Usage:

/mode:auto

Orchestrator Mode

Characteristics:

Advanced planning for complex multi-step tasks
Strategic delegation across multiple MCP servers
Intelligent agent coordination and communication
Parallel task execution when possible
Dynamic resource allocation
Sophisticated workflow management
Real-time progress monitoring across agents
Adaptive task prioritization
Best for: Complex multi-server workflows, enterprise automation

Usage:

/mode:orchestrator

📊 Token & Usage Management

The MCP Client provides advanced controls and visibility over your API usage and resource limits.

View API Usage Stats

Use the /api_stats command to see your current usage:

/api_stats

This displays:

Total tokens used
Total requests made
Total response tokens
Number of requests

Set Usage Limits

You can set limits to automatically stop execution when thresholds are reached:

Total Request Limit: Set the maximum number of requests allowed in a session
Total Token Usage Limit: Set the maximum number of tokens that can be used
Tool Call Timeout: Set the maximum time (in seconds) a tool call can take before being terminated
Max Steps: Set the maximum number of steps the agent can take before stopping

Configuration:

{
  "AgentConfig": {
    "tool_call_timeout": 30,                // Tool call timeout in seconds
    "max_steps": 15,                        // Max number of reasoning/tool steps
    "request_limit": 0,                     // 0 = unlimited, set > 0 to enable limits
    "total_tokens_limit": 0,                // 0 = unlimited, set > 0 for hard cap on tokens
    "memory_results_limit": 5,              // Number of memory results to retrieve (1-100)
    "memory_similarity_threshold": 0.5,     // Similarity threshold for memory filtering (0.0-1.0)
    "enable_tools_knowledge_base": false,   // Enable semantic tool retrieval
    "tools_results_limit": 10,              // Max number of tools to retrieve
    "tools_similarity_threshold": 0.1,      // Similarity threshold for tool retrieval
    "memory_tool_backend": "None"           // Backend for memory tool: "None", "local", "s3", or "db"
  }
}

Note: When any of these limits are reached, the agent will automatically stop running and notify you.

Created by Abiola Adeshina and the OmniCoreAgent Team

Building the future of production-ready AI agent frameworks

⭐ Star us on GitHub | 🐛 Report Bug | 💡 Request Feature | 📖 Documentation | 💬 Discussions