Beyond a
Chatbot Widget
An enterprise-grade multi-bot AI platform with unlimited workspaces, isolated data per bot, membership-aware access control, and 40+ AI models. No per-bot fees, no per-user charges, no per-message caps.
See It In Action
This is not a mockup. This is a production deployment serving thousands of members with tiered content access, real-time streaming, and cross-session memory.
How can I help you today?
Ask anything about your organization's knowledge base, policies, and documents.
Compare the tax benefits of Portugal vs Panama for US citizens
Based on 12 verified sources from your knowledge base, here is a detailed comparison:
What Makes This an Enterprise Solution
Every component is purpose-built for organizations that need unlimited bot workspaces, isolated data, and more than a simple Q&A widget — without per-bot or per-user pricing.
Membership Authentication & Tiered Access
Token-based authentication integrating with WordPress, custom CMS, or any membership platform. Per-user identity with group-based access control routes queries to the correct content collection. Content is physically separated into distinct vector database collections per tier.
Multi-Collection Knowledge Base
Content organized into separate vector database collections per membership tier. Each independently indexed, updated, and searchable. Supports articles, PDFs, video transcripts, podcasts, HTML, JSON, and CSV data.
Persistent Session Management
Every conversation tied to a unique session ID with full multi-turn context. Chat history stored in PostgreSQL, surviving server restarts. Sessions track message count, timestamps, user identity, access tier, detected intents, and IP addresses.
Cross-Session User Memory
Graph-based knowledge store (Graphiti) maintains long-term memory across sessions. Key facts stored as episodes in each user's memory graph. Subsequent visits retrieve relevant history for personalized responses. Memory is scoped per user per collection — zero cross-user leakage.
Async Job Queue with SSE Streaming
Chat requests queued as jobs in PostgreSQL, processed by a dedicated background worker. Real-time progress streamed via Redis pub/sub and Server-Sent Events. 10 granular progress stages from job creation to completion.
Intent Classification System
Hybrid classifier (regex + LLM fallback) analyzes each query before response generation. Detected intents trigger specialized system prompts optimized for tables, comparisons, cost breakdowns, how-to guides, recommendations, and resource location.
Deep Research vs Quick Mode
Two response modes give users control over depth vs speed. Quick Mode scans 15 sources in ~5 seconds. Deep Research scans 80 sources with 20 context documents and up to 16,000 tokens for comprehensive analysis.
AI-Powered Web Search
Augments knowledge base answers with live web data. LLM generates targeted queries, executed via Tavily API. Results compared against knowledge base context to detect outdated information. Sources scored on domain authority, freshness, and consensus.
Smart Multi-Entity Retrieval
Automatically detects comparison queries, decomposes them into sub-queries, retrieves separately from the vector store for each entity, then merges and deduplicates results for balanced coverage.
Source Citations & Rich Metadata
Every response includes title, URL, date, content type, and relevance score. Web sources include domain authority. Dates normalized to ISO format. Sources deduplicated across knowledge base and web. 6 contextual follow-up questions generated per response.
Admin Analytics Dashboard
Login-protected admin interface for monitoring chatbot usage. Session browser with filtering by user, date, and collection. Full conversation transcripts with metadata. User analytics including message counts, active sessions, and usage patterns.
Production Operations
Gunicorn with 4 Uvicorn async workers for concurrent requests. Separate systemd-managed worker process. Health monitoring endpoint, comprehensive logging, graceful shutdown, CORS whitelisting, and configurable LLM/embedding backends.
Two Modes, One Pipeline
Quick Mode
DefaultDeep Research
Power ModeProduction-Grade Infrastructure
Every component is battle-tested in production, serving real users with real membership tiers and real content archives.
Built With Best-in-Class Tools
Per-Bot Pricing vs Unlimited Platform
See why organizations choose our unlimited multi-bot platform over competitors that charge per bot, per user, and per message.
Deployed & Battle-Tested
Currently powering the AI knowledge assistant for a major financial publishing platform with thousands of paying members and multiple content tiers.
Multi-Tier Content
Free, Standard, Premium, and VIP content collections — each independently indexed and access-controlled per member subscription.
Content Types Indexed
Articles, newsletters, podcast transcripts, video transcripts, country profiles, CSV data, and HTML archives — all searchable via RAG.
Real-Time Streaming
10-stage progress updates streamed to the frontend via SSE. Users see exactly what the system is doing at every step.
Personalized Memory
Returning members get context-aware responses that reference their previous conversations and stated preferences.
Start Your Multi-Bot Deployment
Tell us about your organization, bot workspace requirements, and content volume. Our engineering team will prepare a custom architecture proposal.