I Built Self-Evolving Claude Code Memory w/ Karpathy's LLM Knowledge Bases

I Built Self-Evolving Claude Code Memory w/ Karpathy's LLM Knowledge Bases

AI Knowledge Bases and Personal Memory Systems

Introduction to AI Knowledge Bases

  • The speaker discusses the rapidly evolving landscape of AI, particularly focusing on large language models (LLMs) and their application in creating personal knowledge bases.
  • Reference is made to Andre Karpathy's insights on using LLMs for building personal knowledge systems that organize external information effectively.

Building a Personal Knowledge Base

  • The speaker emphasizes the importance of structuring information for agents to query efficiently, highlighting the use of Obsidian as a tool for managing this data.
  • A distinction is made between working with external data (as per Karpathy's model) and internal data, where the speaker has developed a memory system that evolves alongside codebases.

Infrastructure Overview

  • The architecture discussed includes indexing and optimizing how agents explore information, aiming for simplicity and effectiveness compared to existing solutions.
  • An analogy is drawn between handling knowledge and compiling code, illustrating how both processes involve transforming raw input into structured outputs.

Data Processing Steps

  • The process begins with collecting raw markdown documents in an Obsidian vault, which serves as the entry point for various types of content like articles and transcripts.
  • A "compiler" stage follows where an LLM processes this raw information to create summaries and link related documents together.

Finalizing Knowledge Structures

  • The output from the compiler becomes a "wiki," which organizes compiled articles with backlinks connecting different pieces of knowledge.
  • This interconnected structure allows agents to traverse through knowledge more effectively, enhancing search capabilities within the system.

Ensuring Data Integrity

  • A test suite performs checks akin to linting in coding; it identifies gaps or stale data in the knowledge base that need addressing.
  • Emphasis is placed on maintaining data integrity by ensuring all links are functional and relevant documents are included in the wiki.

Querying Information Effectively

  • Finally, queries are run against this structured wiki during runtime, allowing agents to retrieve pertinent information based on current tasks.
  • [] (No timestamp provided here but noted: Karpathy mentions not needing complex retrieval augmented generation (RAG), as LLM can maintain index files effectively.)

Understanding the Structure of an LLM Knowledge Base

The Role of the Index File

  • The index file is crucial as it outlines all folders and resources accessible to the agent, serving as a navigation starting point.
  • This approach simplifies data retrieval without needing complex semantic searches or vector databases, highlighting its effectiveness.

Data Flow and External Information

  • The Obsidian Web Clipper is introduced as a tool for easily importing internet content into the vault, specifically into a raw folder that serves as the source of truth.
  • Processed information is organized in a wiki format, with concepts extracted from raw documents and connections established between them.

Agent's Global Rules and System Understanding

  • The agents.mmd file contains global rules that inform the coding agent about its knowledge base structure, including sources of information and system logs.
  • This meta reasoning allows agents to understand their environment when starting new sessions with their second brain or coding agent.

Simplifying Implementation for Users

  • Users can build their own LLM knowledge-based systems by simply sending a specific prompt to their coding agent, streamlining setup processes.
  • A product requirement document (PRD), linked by Karpathy, outlines necessary components for integrating this system into personal projects.

Enhancing Memory with Session Logs

  • The architecture discussed allows for capturing session logs automatically, evolving memory alongside codebases rather than relying solely on external information.
  • This method enables coding agents to learn from past decisions and project evolution over time.

Introducing InsForge: A Comprehensive Development Platform

Features of InsForge

  • InsForge combines multiple functionalities like database management, authentication, storage solutions, and routing capabilities for large language models into one platform.

Quick Installation Process

  • Installing InsForge CLI takes less than five minutes; users can then prompt it to create applications seamlessly within any codebase.

Demonstrating Application Creation

  • A live demonstration shows how quickly an application can be created using AI models for backend and frontend development while managing databases effectively.

User Engagement with AI Capabilities

  • Users are encouraged to try out Cloud Code prompts in their current projects to enhance long-term memory capabilities within their coding agents.

This structured overview captures key insights from the transcript while providing timestamps for easy reference.

Setting Up an Obsidian Vault for Claude Code Integration

Creating Your Obsidian Vault

  • The speaker recommends starting with an Obsidian vault as a canvas to manage memories and create a wiki using Claude Code.
  • Installation of Obsidian is free and straightforward; users need to select a folder path for their codebase to create a new vault.
  • Users can customize the appearance of their vault by selecting themes, with "Obsidianite" being highlighted as a favorite.

Understanding the System Architecture

  • The system operates solely on Claude Code hooks, eliminating the need for additional installations or integrations.
  • A session start hook is defined in settings.json, which loads necessary files like agents.mmd and index.mmd for efficient querying.

Utilizing Knowledge Base Effectively

  • The knowledge base allows the system to provide answers based on pre-existing information rather than performing deep analysis on the codebase.
  • The agent references specific knowledge base articles when answering questions, showcasing its ability to pull relevant data efficiently.

Daily Logs and Summarization Process

  • Daily logs capture summaries of conversations with Claude Code, serving as raw data equivalent for future reference.
  • The system's efficiency is emphasized; it retrieves answers quickly without needing extensive searches through git logs or codebases.

Hooks for Context Management

  • Two key hooks (pre-compaction and session end) send messages from Claude Code to another language model for summarization before losing context.
  • Summaries include decisions made, lessons learned, and action items from each session, ensuring structured documentation.

Customization Options

  • Users can modify scripts related to flushing logs or compiling summaries, allowing customization of prompts sent to the Claude agent SDK.
  • This flexibility enhances user experience compared to standard memory systems in Claude Code.

Understanding the Compounding Loop in Knowledge Bases

The Self-Contained System

  • The system has comprehensive access to its agents and memory processes, allowing it to understand how everything functions, including prompts and daily logs.
  • It is described as a self-contained system capable of self-improvement over time.

Leveraging Knowledge for Answers

  • The process begins with asking questions that leverage the knowledge base, leading to synthesized answers from various sources like wiki articles.
  • As conversations progress, information is continuously saved, contributing to an expanding knowledge base that improves with each interaction.

Continuous Improvement of Responses

  • The agent's ability to search through accumulated knowledge enhances over time; more questions lead to better answers without requiring additional effort from users.
  • Automatic extraction of key takeaways occurs after sessions end or during memory compaction, ensuring valuable insights are logged efficiently.

Building Long-Term Memory

  • Insights gained from interactions will eventually be integrated into a personal wiki, enhancing the quality of responses provided by the agent over time.
  • Acknowledgment is given to Claude for inspiration regarding long-term memory systems and their implementation in personal knowledge management.

Community Engagement

  • Mention of a recent workshop conducted in the Dynamis community where practical applications were demonstrated for building a second brain.
Video description

Andrej Karpathy posted about using LLMs to build personal knowledge bases - raw articles go in, an LLM compiles them into an interconnected wiki, and health checks keep everything consistent. It went massively viral. But here's the thing: the most valuable raw data isn't external articles. It's your own conversations with your agents. Every time you work with Claude Code, you make decisions, discover gotchas, learn patterns, and build up context that vanishes when the session ends or the context window compacts. What if instead of losing all of that, every conversation automatically compiled into a structured knowledge base that gets smarter over time? That's what I built and in this video I'll show you how to use it! ~~~~~~~~~~~~~~~~~~~~~~~~~~ - Try InsForge - the open source platform that gives your coding agent a database, auth, storage, AI model routing, and hosting all in one. Free to get started, use code InsForgePromo for a free month of Pro: https://insforge.dev/ ~~~~~~~~~~~~~~~~~~~~~~~~~~ - If you're interested in building your own AI second brain to save yourself hours every week, check out the Dynamous community and the new 4 hour second brain bootcamp: https://dynamous.ai/second-brain-bootcamp - Karpathy inspired Claude Code memory system repo: https://github.com/coleam00/claude-memory-compiler - Karpathy's original Tweet: https://x.com/karpathy/thread/2039805659525644595 - Gist to build your own Karpathy LLM knowledge base: https://gist.github.com/karpathy/442a6bf555914893e9891c11519de94f ~~~~~~~~~~~~~~~~~~~~~~~~~~ 0:00 Karpathy LLM Knowledge Bases 2:32 How it Works 6:01 Data Flow and Architecture 8:40 My Claude Code Memory System 9:19 InsForge 10:43 Setting Up the System 11:04 Obsidian Setup and Vault Configuration 12:16 Claude Code Hooks 16:51 The Compounding Knowledge Loop 18:35 Outro ~~~~~~~~~~~~~~~~~~~~~~~~~~ Join me as I push the limits of what is possible with AI. I'll be uploading videos weekly - at least every Wednesday at 7:00 PM CDT!