volcengine/OpenViking

OpenViking is an open-source context database built specifically for AI agents. It replaces the flat vector storage model of traditional RAG with a filesystem paradigm — organizing agent context (memories, resources, and skills) into a hierarchical structure that mirrors how developers already think about projects. The result is faster, cheaper, and far more accurate retrieval for long-running agent systems. The core architecture implements three-tier context loading (L0/L1/L2): data loads on-demand as agents need it, dramatically cutting token consumption compared to systems that dump everything into context at once. When an agent needs information, OpenViking positions to a high-score directory first, then drills recursively into subdirectories — preserving global context structure rather than returning isolated chunks. All retrieval trajectories are visualized, so developers can see exactly how context was navigated during debugging. Memory is self-evolving: conversations are automatically compressed and long-term memories extracted over time. Multiple LLM providers are supported including Volcengine Doubao, OpenAI, and any LiteLLM-compatible model.

rag

Python

Why It Matters

Traditional RAG stores context in flat vector databases, causing context fragmentation and overflow on long-running agent tasks. OpenViking's filesystem paradigm dramatically reduces token usage through tiered loading while making retrieval both cheaper and more accurate. As AI agents become more autonomous, a purpose-built context database with observable retrieval chains will be foundational infrastructure for any serious agent deployment.

Repository Stats

Stars

12.3k

Forks

841

Last Commit

3/15/2026

View on GitHub Visit Website

Related Resources

AI News & Articles

Read about the latest developments related to volcengine/OpenViking

AI Tools Directory

Compare commercial AI tools and find the right one for your workflow