GraphRAG Concepts
GraphRAG (Knowledge Graph-based Retrieval-Augmented Generation) is an advanced RAG pattern that uses LLMs to extract a structured Knowledge Graph from unstructured text. This allows for reasoning across an entire corpus rather than just retrieving isolated snippets.
The Core Innovation: Beyond Similarity
Traditional RAG (Vector Search) is "near-sighted"; it finds similar text chunks but cannot "connect the dots" between pages that don't share semantic vectors. GraphRAG builds a Map of Knowledge where nodes are Entities and edges are Relationships.
Key Techniques for Agentic Wikis
1. Hierarchical Community Detection
Using algorithms like Leiden, GraphRAG clusters related notes into "communities."
- Application: This is the machine equivalent of a Map of Content (MOC). While humans build MOCs for navigation, an agent can use Community Summaries to understand the "Big Picture" of a vault without reading every file.
2. Global vs. Local Search
- Local Search: "Tell me about Entity X." (Explores immediate links).
- Global Search: "What are the major themes in this vault?" (Uses community summaries to synthesize a holistic answer).
3. Entity & Relationship Extraction
GraphRAG treats text as a source of Claims.
- The Seam: This is where the boundary between Flat Text and Structured Relational data dissolves. A YANP note is the source, but the extracted Graph is the "Active Memory."
Leverage for Humans-in-the-Loop
GraphRAG makes the agent's internal "world model" transparent:
- Inspectability: Humans can see the graph and correct a "broken link" or a "hallucinated relationship."
- Traceability: Every claim in the graph points back to a source
TextUnit(the original note).
Comparison: Wikilinks vs. GraphRAG
| Feature | Wikilinks (Human) | GraphRAG (Agent) |
|---|---|---|
| Creation | Manual / Intentional | Automatic / Extracted |
| Granularity | Page-to-Page | Entity-to-Entity |
| Discoverability | Navigational | Computational |
| Maintenance | High Effort | Algorithmic |