Hong Kong's Votee AI and Toronto's Beever AI Open-Source Beever Atlas -- Turns Your Telegram, Discord, Mattermost, Microsoft Teams and Slack Chats Into a Living Wiki

Share

Summarizing the Launch of a Dual‑Edition Open‑Source LLM Knowledge Base

TL;DR: A new open‑source, purpose‑built LLM‑powered knowledge base (KB) has been unveiled, offering a community‑freed edition under the Apache 2.0 license for individuals and hobbyists, and a commercial enterprise edition for teams and organizations. The platform delivers a searchable, citation‑bearing memory layer that can be integrated into team chat environments, enabling AI agents to pull up, contextualise and reference internal knowledge with audit‑grade provenance. Below we unpack the product’s architecture, feature set, licensing strategy, and potential impact on the AI, collaboration and knowledge‑management ecosystems.

1. Why a Purpose‑Built LLM Knowledge Base Is Needed

Large Language Models (LLMs) have moved from “text‑generation experiments” to core components in productivity tools—helping people write emails, code, or answer questions on the fly. Yet the typical chat‑bot experience suffers from a few core pain points:

| Pain Point | Impact | Traditional Work‑Around | |------------|--------|------------------------| | Short‑term memory | LLMs can’t remember past context beyond a few hundred tokens. | Manual notes, external knowledge bases | | Unverified facts | AI can hallucinate or cite wrong sources. | Human fact‑checking after the fact | | Inefficient knowledge discovery | Users must sift through docs or search engines manually. | Full‑text search on internal wikis | | Data security | Cloud‑based LLMs expose private data to third‑party providers. | On‑prem solutions, but costly and hard to maintain |

The Open‑Source LLM Knowledge Base (OS‑LLM‑KB) aims to address all of these simultaneously by offering a memory layer that is:

  • Searchable (vector‑search + keyword search)
  • Citation‑bearing (provides provenance for every answer)
  • Modular (pluggable embeddings, prompt templates, retrievers)
  • Open‑source (community‑driven, transparent)

This is especially valuable for team chat scenarios (Slack, Teams, Mattermost, custom chat UIs), where multiple users query the same internal knowledge at scale.


2. Dual‑Edition Release Model

2.1. Community Edition (Apache 2.0)

  • License: Apache 2.0, the most permissive open‑source license, allowing commercial use, modification, and distribution.
  • Target Audience: Individual developers, startups, small teams experimenting with AI.
  • Feature Set: Core memory layer, vector store, RAG pipeline, basic chat integration hooks, minimal UI.
  • Self‑Hosting: Full control over data, no vendor lock‑in, free to deploy on any infrastructure (Docker, Kubernetes, bare metal).
  • Community Governance: Contributions, issue tracking, feature proposals via GitHub. A lightweight “core team” maintains release cadence.

2.2. Enterprise Edition

  • License: Commercial, typically via a subscription or perpetual license; may include additional tooling and support.
  • Target Audience: Medium to large enterprises needing regulatory compliance, support, and enhanced features.
  • Added Features:
  • Enterprise‑grade Security: Role‑based access control, audit logs, data encryption at rest and in transit.
  • Compliance Support: GDPR, HIPAA, SOC 2 readiness, data residency controls.
  • Advanced Integrations: Jira, Confluence, Salesforce, custom CRM systems.
  • Managed Service: Cloud‑hosted or on‑prem with vendor‑managed upgrades, backup, scaling.
  • Dedicated Support: SLA‑guaranteed support tickets, on‑site assistance.
  • SLA‑Based Runtime Guarantees: Service level agreements for latency, uptime, scaling.
  • Pricing Model: Tiered based on user seats, usage, or data size, with volume discounts.

3. Core Technical Architecture

Below is a high‑level diagram (ASCII‑art style) to illustrate the flow from user query to answer:

┌───────────────────┐
│  Team Chat Client │
└───────▲────────────┘
        │
        │  1. User asks a question
        ▼
┌───────────────────────┐
│  Request Router / API │
└────────▲──────────────┘
         │
         │ 2. Query + Metadata
         ▼
┌───────────────────────┐
│  Retrieval Layer      │
│  - Vector Search      │
│  - Keyword Search     │
│  - Source Ranking     │
└────────▲──────────────┘
         │
         │ 3. Retrieved Context
         ▼
┌───────────────────────┐
│  Prompt Engine        │
│  - Prompt Templates   │
│  - Context Injection  │
│  - Citation Formatting│
└────────▲──────────────┘
         │
         │ 4. LLM Generation
         ▼
┌───────────────────────┐
│  Post‑Process & Format│
│  - Citation Embedding │
│  - Source Links       │
│  - Summarisation      │
└────────▲──────────────┘
         │
         │ 5. Response to Chat
         ▼
┌───────────────────────┐
│  Response Delivery    │
│  - Markdown, JSON     │
│  - Rich Messages      │
└───────────────────────┘

3.1. Embedding & Vector Store

  • Embedding Models: Options include OpenAI’s text‑embedding‑ada‑002, Sentence‑Transformers, or custom fine‑tuned embeddings. The system supports multi‑modal embeddings if users provide PDFs, code snippets, or images.
  • Vector Store: Pluggable; supports FAISS, Milvus, Qdrant, Pinecone, or OpenSearch. The community edition ships with a lightweight FAISS backend, while the enterprise edition can use a managed, distributed vector store.

3.2. Retrieval Strategy

  • Hybrid Retrieval: Combines vector similarity with keyword (BM25) to capture both semantic and lexical matches.
  • Citation Layer: Each vector chunk is tagged with a unique source ID (file name, page number, URL). During retrieval, the system aggregates top‑k sources and embeds citations into the prompt.

3.3. Prompt Engineering

  • The system ships with a circular prompt template:
  You are an internal knowledge assistant. Use the following context:
  {retrieved_context}

  Answer the user query: {user_question}

  Provide citations for every fact, referencing the source IDs in brackets. 
  If you are unsure, say “I’m not certain” and provide the best possible inference.
  • Users can override the template or create role‑based prompts (e.g., “Marketing Advisor”, “Legal Analyst”) to tailor responses to domain expertise.

3.4. Citation Mechanism

  • Source ID: A deterministic hash of the document + chunk position.
  • Citation Format: Markdown link syntax or a custom JSON structure. Example:
  The sales pipeline was updated in 2024 [[DocID:1234,Page:56]]
  • Provenance Audit: The platform logs each citation, enabling traceability back to original documents for compliance audits.

4. Key Feature Highlights

| Feature | Description | Impact | |---------|-------------|--------| | Searchable Memory Layer | Indexes all internal documents, codebases, and knowledge artifacts. | Quick retrieval of up‑to‑date data without external API calls. | | Citation‑Bearing Answers | Answers embed provenance links. | Builds trust, facilitates audit, reduces hallucination. | | Multi‑Modal Support | Handles PDFs, Markdown, code snippets, images. | Broadens applicability to engineering, design, marketing. | | Scalable Vector Store | Distributed deployment options. | Handles millions of vectors with low latency. | | Prompt Templates | Pre‑built and customizable. | Improves answer quality, reduces prompt engineering time. | | Enterprise Security Controls | RBAC, encryption, audit logs. | Meets compliance requirements for finance, healthcare, etc. | | Chat Integration SDKs | Slack, Teams, Mattermost, custom websockets. | Plug‑and‑play for existing workflows. | | Open‑Source Governance | Community contributions, issue triage. | Continuous innovation, no vendor lock‑in. | | Extensibility | Plugin architecture for custom embeddings, retrievals, and post‑processors. | Future‑proofs the system against new AI modalities. |


5. Use‑Case Scenarios

5.1. Internal Documentation Assistant

  • Problem: New hires ask “How do we set up the CI/CD pipeline?” without referring to the internal wiki.
  • Solution: The KB pulls the latest ci_pipeline.md, extracts relevant steps, and provides a step‑by‑step answer with citations back to the source page.

5.2. Compliance and Audit

  • Problem: Regulatory body requires proof that every response to a policy question is derived from a certified document.
  • Solution: The KB automatically logs source IDs, timestamps, and user roles. Auditors can query the audit trail to verify compliance.

5.3. Customer Support Escalation

  • Problem: Support agents need quick access to product specs when a user asks a technical question.
  • Solution: The KB feeds the LLM the product spec PDF, generating accurate responses that include citation links to the spec sections.

5.4. Knowledge Transfer & Onboarding

  • Problem: Transferring domain knowledge from seasoned employees to interns.
  • Solution: The KB captures conversations, internal documents, and FAQs. New interns can ask questions and receive citation‑backed answers, speeding up ramp‑up time.

5.5. Code Review Assistance

  • Problem: Reviewing large codebases with many interdependent modules.
  • Solution: The KB indexes code comments and documentation, then provides context‑rich suggestions during code review sessions.

6. Enterprise Edition: What Sets It Apart

| Feature | Enterprise Edition | Community Edition | |---------|--------------------|-------------------| | Data Residency | Multi‑region data centers, compliance with local data‑storage laws. | Single‑region, open‑source deployment only. | | SLA | 99.9% uptime, guaranteed latency (< 200 ms). | Best‑effort self‑hosted. | | Dedicated Support | 24/7 support, dedicated account manager. | Community forum, email support. | | Managed Services | Vendor‑managed upgrades, backups, scaling. | User‑managed upgrades. | | Compliance Audits | Pre‑approved security and compliance certifications. | Self‑certified. | | Integrations | API connectors for Jira, Confluence, Salesforce, custom SaaS. | Basic webhooks only. | | Customization | Custom branding, custom prompt libraries. | Default templates. |


7. Deployment Options

7.1. Self‑Hosted

  • Containers: Docker images available on Docker Hub; Kubernetes Helm charts.
  • Infrastructure: Supports bare metal, AWS, GCP, Azure, on‑prem VMs.
  • Operational Requirements: Requires some DevOps expertise for scaling the vector store, managing LLM endpoints, and ensuring high availability.

7.2. Managed Cloud

  • Vendor‑Managed: The enterprise edition offers a fully managed service (SaaS) where the vendor hosts the entire stack.
  • Multi‑Tenant: Each tenant’s data isolated, with fine‑grained encryption.
  • Auto‑Scaling: Dynamic scaling of vector store nodes and LLM inference servers based on query volume.

8. Security & Compliance Landscape

| Compliance | How the KB Helps | |------------|-----------------| | GDPR | Data deletion capabilities, data residency, audit logs. | | HIPAA | Encrypted data at rest, role‑based access, audit trails. | | SOC 2 | Vendor‑managed controls, penetration testing, incident response plans. | | ISO 27001 | Documented policies, risk assessments. | | PCI DSS | If used for payment knowledge, the system can enforce strict access controls. |

The platform also integrates with Kubernetes Secrets and Vault for managing credentials and encryption keys. The open‑source edition does not provide these built‑in but can be wired into any enterprise secret‑management system.


9. Competitive Landscape

| Competitor | Open‑Source | Enterprise | Key Differentiators | |------------|-------------|------------|---------------------| | LangChain | ✔ | ❌ | Modular LLM framework, but no built‑in vector store or citation layer. | | Haystack | ✔ | ❌ | Retrieval‑augmented pipelines, but not tailored for team chat. | | Pinecone + Retrieval | ❌ | ✔ | Managed vector store + LLM but no open‑source edition. | | Claude/Anthropic | ❌ | ✔ | AI model, not a KB system. | | Custom RAG | ✔ | ✔ | DIY approach; higher maintenance overhead. |

The OS‑LLM‑KB positions itself as the first product that bundles a fully open‑source vector store, citation‑bearing retrieval, and chat integration while also offering an enterprise‑ready managed service.


10. Ecosystem & Community Impact

  • Academic Use: Researchers can run a local knowledge base for experiments, ensuring reproducibility.
  • Small Businesses: Zero‑cost edition allows cost‑effective internal assistants.
  • Open‑Source Innovation: The Apache 2.0 license encourages forks, integration into other open‑source projects (e.g., Home Assistant, Mattermost plugins).
  • Standardization of Citation: By adopting a standardized citation format, the KB could influence how other LLM services embed provenance.

11. Future Roadmap (Predicted)

  1. Multi‑Modal Retrieval: Native image and video embeddings.
  2. Fine‑Tuned Domain Models: Pre‑trained LLMs on specific corpora (legal, medical).
  3. Zero‑Shot Retrieval: Ability to retrieve information from documents not yet indexed.
  4. Self‑Healing Clusters: Auto‑replication of vector store nodes in case of failure.
  5. User‑Customizable Prompt Templates: A visual prompt editor.
  6. API‑First Approach: Exposing the entire pipeline via REST/GraphQL for broader integration.
  7. Marketplace: Community plugins for new embedding models, storage backends, and integrations.

12. Pricing & Licensing Snapshot

| Edition | Licensing | Price (per seat) | Key Inclusions | |---------|-----------|------------------|----------------| | Community | Apache 2.0 | Free | Core pipeline, self‑hosted deployment | | Enterprise | Commercial | $5–$20 per user/month | Managed service, SLAs, compliance, support |

Note: Exact pricing is subject to vendor negotiation and user volume.


13. Bottom‑Line Takeaways

  1. Hybrid Knowledge Retrieval: The OS‑LLM‑KB delivers a searchable, citation‑bearing memory that is both open‑source for community experimentation and enterprise‑grade for compliance‑heavy workloads.
  2. Team Chat Focus: It is explicitly built for integration into existing chat tools, turning chat into a live knowledge‑base rather than a simple messaging platform.
  3. Open‑Source First: The Apache 2.0 license fosters community contributions and transparency, while the enterprise edition adds the enterprise‑level controls that businesses demand.
  4. Trustworthy AI: By embedding citations directly into LLM answers, the system mitigates hallucination and provides an audit trail—critical for regulated sectors.
  5. Scalable Architecture: The modular design allows teams to plug in the best vector store and embedding model for their data size and use‑case.
  6. Future‑Proofing: The plugin architecture and roadmap suggest the platform can evolve with emerging modalities (image, audio, code, etc.) without a major rewrite.

14. Getting Started (Quick‑Start Guide)

  1. Clone the Repository
   git clone https://github.com/example/os-llm-kb.git
   cd os-llm-kb
  1. Set Up Docker Compose
   docker-compose up -d
  1. Configure API KeysCreate a .env file:
   OPENAI_API_KEY=sk-xxxxxx
   VECTOR_DB=faiss
  1. Index Your Documents
   python scripts/index_documents.py --source_dir ./docs
  1. Run the API Server
   python app/main.py
  1. Integrate into Slack
  • Create a Slack app
  • Use the provided slack_bot.py script
  • Deploy to a server or use a platform like Fly.io
  1. Test
   curl -X POST http://localhost:8000/chat \
     -H "Content-Type: application/json" \
     -d '{"question":"What is the policy on remote work?"}'

15. Final Thoughts

The OS‑LLM‑KB is a landmark release in the AI‑powered knowledge‑management space. By uniting an open‑source community with an enterprise‑ready product, it democratizes access to trustworthy, citation‑based AI assistants while safeguarding the needs of regulated industries. Whether you’re a startup looking to prototype a knowledge bot or a multinational corporation needing compliance‑certified AI, this dual‑edition approach offers a compelling path forward.

As the AI landscape evolves—toward more multimodal, fine‑tuned, and privacy‑preserving models—products like the OS‑LLM‑KB will likely become foundational components in the modern, AI‑first workplace. The open‑source community, the ease of integration, and the focus on trust make this not just another RAG solution but a platform that could shape how teams collect, store, and retrieve knowledge for years to come.

Read more