The GBrain Validation: Y Combinator's CEO Runs His Agents on Markdown
In April 2026, Garry Tan (President and CEO of Y Combinator) open-sourced GBrain, the system he built to run his actual AI agents. The numbers are staggering: 17,888 pages, 4,383 people profiles, 723 companies, 21 autonomous cron jobs, built in 12 days, starting from 10,000+ markdown files and 3,000 people pages.
GBrain stores knowledge as markdown files in a Git repository, backed by PostgreSQL and pgvector for hybrid search. But the architecture is deliberately simple at its core: "Compiled truth on top, append-only timeline below."
Every page follows this pattern: a living summary section that gets rewritten as evidence changes, followed by an immutable timeline preserving the proof trail. Tan specifically notes that "GBrain thanks to being git+postgres works wonderfully with multiple agents simultaneously," describing something vector-only systems struggle with: multiple agents reading and writing to the same knowledge base without corruption or drift.
The system runs a nightly "dream cycle" that enriches entity pages, consolidates memory, fixes citations, and wires the knowledge graph — all while the user sleeps. The knowledge graph isn't built with expensive LLM calls. It's extracted automatically: every page write parses entity references and creates typed links (attended, works_at, invested_in, founded, advises) with zero LLM calls.
On the BrainBench benchmark, GBrain achieves P@5 49.1% and R@5 97.9%, beating its own graph-disabled variant by +31.4 points — proving that the graph layer and structured timeline aren't nice-to-haves, they're the difference between retrieval and understanding.