KMM v0.0.2 Revolutionizes Knowledge Management with Modular, End-to-End Pipeline and Seamless Cloud Sync
June 21, 2026
The framework is designed as an end‑to‑end pipeline (collect → analyze → notes → knowledge graph → cloud sync) rather than a collection of isolated scripts, enabling cohesive knowledge flow.
Each tool category uses a unified TOOL_INVENTORY structure that records capability levels, deployment status, and applicable scenarios.
KMM is most valuable when agents forget prior information, when RAG results are noisy, or when ongoing information ingestion is needed, emphasizing a solid collection pipeline and knowledge routing before chasing complex search or storage systems.
The collection pipeline comprises three layers with multiple tools (e.g., 9Scrapling, Chrome DevTools Protocol, GStack Browser, yt-dlp, Whisper ASR, 9SenseNova, MinerU, book_cache) to ingest videos, audio, and documents.
Refinement capabilities include refine_pdf to convert PDFs into Hermes Skill and KMM notes, placing results in designated directories for skills and notes.
A core design is local-first augmented search (ANYSEARCH), which searches local notes first and automatically falls back to web‑centric search if results fall below a threshold to prevent hallucinating local knowledge.
KMM v0.0.2 completes the chain by decoupling collection and memory storage, focusing on three tasks: pull raw knowledge from over 40 tools, refine it into structured notes and graph nodes, and synchronize across devices by writing to OneDrive for cross‑device access.
The framework is modular and reusable; not all 40+ tools are required, but the hierarchical design—by medium, then refinement, then weighted search—provides a robust modeling pattern.
Practical design decisions include avoiding reimplementing cloud sync in Python, recognizing OCR’s importance for video content, and separating deduplication from collection to other maintenance processes.
An end‑to‑end ingestion workflow is described: a Douyin video link is processed in parallel paths (audio via yt‑dlp, transcription via Whisper, key frames via PaddleOCR/EasyOCR), summarized into notes, written into gbrain, and synced to the cloud.
A three‑tier recall system combines a Local FTS5 index for speed, Hindsight vector search for semantic similarity, and the gbrain knowledge graph for associative reasoning, ensuring reliable retrieval.
Cloud synchronization uses rclone to unify syncing across 12+ services (including OneDrive, Baidu Cloud, Dropbox, Mega) with bidirectional sync every four hours, avoiding bespoke cloud SDKs.
Summary based on 2 sources