Knowledge OS Stage 1 β wiki + ingest + MCP
Context
Aspire org knowledge sits scattered across 12+ surfaces: Hubdoc, Xero, GitLab, Notion, Slack history, Google Drive, individual brains. Asking "who's our backup produce supplier for Chanya?" or "where did we decide on Stripe vs Zinga Pay?" requires manual hunting. KO Stage 1 builds a queryable wiki backed by an Obsidian vault + Postgres index, fed by both human writes and an OCR ingest pipeline. iZinga is the platform brand; KO is its first product.
Detail
Goal
A markdown-based wiki where any Aspire agent or human can answer business questions in <5 seconds via either Obsidian quick-switcher or MCP knowledge_query.
Architecture
Components
| Component | Status | Notes |
|---|---|---|
Obsidian vault (aspire/knowledge-os-vault) | β live | git-backed, 8 brand namespaces + _meta + _inbox + _templates |
| Obsidian Git plugin (auto-commit 5min, auto-pull 10min) | β live | Verified 2026-05-17 |
| Templater (6 page-type templates) | β live | Hotkey Cmd+Shift+T |
Dataview dashboard (_meta/dashboard.md) | β live | 7 live queries |
wiki-api (FastAPI, port 8081) | β built | Postgres index + REST endpoints |
mcp-bridge (FastMCP, port 8082) | β built | 6 tools: health, knowledge_query, get_page, ingest, update_page, lint_page |
worker-ocr (async PDF β vault) | β built | Vision via claude-haiku-4-5 (qwen-3.6-omni broken upstream); native PDF first |
ko-knowledge MCP wired into Claude Code | β live | Verified end-to-end at 3 layers 2026-05-16 |
| Production VPS deploy on OpenClaw | βΈ pending | when: manual gate in .gitlab-ci.yml; blast radius 28 agents |
| Aspire LLM Gateway integration | β live | Virtual key knowledge-os smoke-tested 2026-05-19 |
Milestones
- [x] T0 β Vault repo + Obsidian setup (2026-04-19)
- [x] T1βT6 β Engine scaffold + wiki-api + mcp-bridge + worker-ocr (AprilβMay 2026)
- [x] T7d β Ingest fallback chain (pdftotext β pdfplumber β PyMuPDF β AI vision) (2026-05-12)
- [x] T5 β Vault round-trip (human + agent writes both reach
origin/mainwith distinguishable authorship) (2026-05-16) - [x] T5b β Obsidian.app UX layer + 3 plugins configured (2026-05-17)
- [x] Gateway migration β Engine uses Aspire LLM Gateway virtual key (2026-05-17)
- [ ] T0 stress test β 5+ hand-written Chanya supplier pages (in progress)
- [ ] Production deploy β Manual click in GitLab CI when smoke evidence is strong
- [ ] Real PDF batch ingest β process 10-20 real Aspire documents through worker-ocr
Stage 2 / Stage 3 (out of scope this phase)
- Stage 2 = LightRAG semantic over vault when query volume > 1k/day OR pure FTS demonstrably insufficient
- Stage 3 = per-tenant isolated indexes for iZinga's commercial product
Open questions
- Content ownership post-seeding β who maintains vault quality day-to-day?
- cloud-first.ai data retention policy β confirmed on-prem for operational data routing?
- Pilot brand for first real load β Chanya remains the default per original spec.
Provenance
Full architecture: aios-workspace/engineering/projects/izinga/ARCHITECTURE.md. Engine repo: gitlab.dssc.co.th/aspire/infrastructure/knowledge-os. Vault repo: gitlab.dssc.co.th/aspire/knowledge-os-vault.