Why are traditional databases not ideal for AI agent memory?

Traditional databases like relational or NoSQL also require significant training for LLMs to query effectively. While vector databases are good for similarity search, they are often narrowly useful compared to the universal understanding LLMs have of file system semantics.

How does Unix philosophy relate to AI agent development?

The Unix philosophy of building complex systems from simple, atomic building blocks, where everything is represented as a file descriptor, is highly compatible with LLMs. This principle allows for powerful agent capabilities by leveraging LLMs' inherent understanding of file system operations.

What is the role of markdown files in agent skills?

Markdown files, like those used in Claude.md and for agent skills, are powerful because LLMs natively understand their structure. This allows developers to provide agents with information, behaviors, and skills without complex training.

How can cloud file systems like Box enhance AI agents?

Cloud file systems offer durability, multi-user support, versioning, and built-in governance. They provide an inspectable audit trail of agent actions and can be mounted as local file systems, allowing agents to operate seamlessly.

What are the benefits of using file system semantics for AI agents?

LLMs are already pre-trained on file system semantics, making them highly productive tools. This approach also creates a familiar interface for both humans and agents, reducing the need for custom development and training.

What is the 'winning engineering principle' for building AI agents?

The winning principle is to 'ride the model's priors' by leveraging existing paradigms and interfaces that the LLM is already trained on, such as file systems, rather than trying to retrain agents or build bespoke solutions.

Why is it important for AI agent memory to be inspectable?

LLMs are non-deterministic. If their memory is a black box, it becomes very difficult for operators to understand, debug, and introspect the agent's behavior and decision-making process.

Key Moments

AI Dev 26 x SF | Carter Rabasa: File Systems Are the New Primitive for AI Agents

DeepLearning.AI

Education6 min read33 min video

May 21, 2026|36 views|2|1

Save to Pod

Want to know something specific about what's covered?

We've already dissected every moment. Ask and we will deliver (with timestamps).

Key Moments

TL;DR

LLMs are already masters of file systems due to decades of training, yet developers keep building custom APIs for AI agents instead of leveraging this innate capability, making development unnecessarily complex.

Key Insights

90% of enterprise data is unstructured, yet traditional software development often focuses on the remaining 10% residing in relational or NoSQL databases.

LLMs possess extensive training on Unix/Linux file system commands and semantics, which can be leveraged for agent development, eliminating the need for bespoke APIs.

Agent skills, like Claude.md and Langchain's approach, are increasingly standardized around file system structures (directories, skill.md files) because LLMs natively understand them.

OpenAI's Agent SDK now supports remote file system mounts, allowing agents to interact with cloud storage like Box as if it were a local file system.

Leveraging file systems for agent-human interaction simplifies development by providing a shared, intuitive interface, reducing the need to build custom web or Slackbot interfaces.

The Unix philosophy of building complex systems from simple, atomic building blocks (like file descriptors representing various resources) is highly relevant for modern AI agent design.

The overlooked potential of file systems for AI agents

The prevailing approach to AI agent development has often overlooked the inherent capabilities of Large Language Models (LLMs) regarding file systems. Despite LLMs being trained on vast amounts of code, operating systems, and file-based workflows, developers frequently resort to building custom APIs and complex data schemas. This is particularly surprising given that 90% of enterprise data is unstructured (PDFs, Word docs, etc.), a domain where file systems naturally play a crucial role. The traditional focus on relational and NoSQL databases, which constitute only about 10% of how work gets done, has led to a significant underutilization of LLMs' pre-existing expertise in manipulating file structures. This talk argues that file systems should be considered the new primitive for building AI agents, as they provide a powerful, intuitive, and readily understood foundation for memory, state, and agent capabilities.

Why file systems are a natural fit for LLMs

LLMs have been inadvertently trained on decades of file system semantics. Every command-line interface, every GitHub repository, every Linux man page has contributed to an LLM's deep understanding of how directories are structured, how files are organized, and how to perform operations like `ls`, `cat`, `grep`, and `write`. This ingrained knowledge means that introducing file system operations to an agent requires little to no retraining. Developers can leverage this 'prior,' meaning the model's existing knowledge, rather than spending effort teaching it bespoke systems. The Unix philosophy, which emphasized building complex systems from simple, atomic building blocks represented by file descriptors, is directly applicable here. This principle allows for radical complexity to be engineered on top of simple, uniform interfaces, which is precisely what file systems offer to LLMs.

File system semantics enable effective long-term memory

A critical challenge in agent development is providing long-term memory. Effective memory solutions must retain facts across sessions, retrieve information selectively, allow for updates and corrections, and be inspectable by operators. While databases like relational, NoSQL, or vector databases have their strengths, they often fall short on these criteria, especially human inspectability. File systems, however, naturally support these requirements. Agents can read and write files to store state, organize information in hierarchical directories, and update content. The non-deterministic nature of LLMs makes inspectability crucial, and file systems provide a human-readable layer for this. Moreover, cloud-based file systems, like Box, add benefits such as durability, versioning, and robust audit trails, making them ideal for enterprise-grade agent memory.

Emerging agent frameworks embrace file system paradigms

This shift towards file systems as a core primitive is already evident in popular agent development frameworks. For instance, Claude.md initially used simple markdown files to provide LLMs with skills and behavioral instructions, which has since evolved into the standardized concept of 'agent skills'—essentially directories with a `skill.md` file. Langchain's Deep Agents SDK utilizes file system semantics for its pluggable backend, supporting commands like `write file`, `edit file`, and `glob`. Similarly, OpenAI's Agent SDK now supports remote file system mounts, allowing agents to treat cloud storage, such as a Box drive, as a local workspace. These advancements demonstrate a recognition that LLMs are already proficient with file system operations, and developers should build upon this existing understanding rather than inventing new, bespoke interfaces.

Demonstrating file system-based agent interaction

A practical demonstration showcased how a simple Python application, with an agent defined in less than 20 lines of code, leverage Box files for team management. Markdown files (`team.md`, `tasks.md`) described team members and tasks. By marking a task as complete in a human-readable UI (Box), the agent, running via the OpenAI Agent SDK mounted to Box, detected the change. The agent then synchronized this update across both files using basic file system operations, without needing custom APIs or direct CLI access. This illustrates the power of human-in-the-loop systems where both humans and agents can interact with a shared file system, simplifying complex workflows and enhancing collaboration. The demonstration highlighted the agent's ability to introspect and synchronize data across denormalized files, embodying proactive task management through file manipulation.

Beyond local files: Unix's broader influence

The concept extends beyond traditional file storage. The Unix philosophy treated everything as a file descriptor, including running processes (`/proc`) and device I/O. This principle is highly relevant for agents. For example, accessing CPU information on a cloud VM can be done by reading from `/proc/cpuinfo`, treating system processes as files accessed via file system semantics. While not advocating for the complete abandonment of databases or APIs, the talk suggests building 'shims'—interfaces that translate agent needs into file system operations. An example is the Chroma team's collaboration with Mitlfi to create a `ChromaFS` shim, enabling access to Chroma data through file system semantics. This highlights how existing data stores can be integrated with agents by presenting their data in a file-like manner.

File systems as bridges for human and agent interaction

Leveraging file systems is advantageous not only for LLMs but also for human users involved in an agent's workflow. Instead of building bespoke web interfaces or Slack bots that require user training and significant development effort, developers can utilize shared file systems. This provides a common, intuitive interface that both humans and agents understand, drastically simplifying the development of collaborative agentic applications. As agent capabilities expand into broader computer use—like operating a Mac—understanding native file system operations becomes paramount. Cloud file systems further enhance this by offering durability, multi-user support, versioning, and enterprise-grade governance and permissions, making them a robust choice for many agent applications.

Riding the model's priors: the winning engineering principle

The core engineering principle for modern AI development is to 'ride the model's priors'—leverage what LLMs already know. Instead of spending time and resources retraining agents or developing custom solutions for every problem, engineers should seek existing paradigms and interfaces that LLMs are inherently trained on. File systems represent a massive, inherent 'prior' for LLMs. The talk concludes by reiterating the profound simplicity and scalability of Unix's initial idea: representing everything as a file. This foundational concept has powered some of the most complex systems ever built, and by neglecting it for bespoke, complex solutions, developers are missing out on a powerful, native capability of LLMs. The ongoing challenge is to remember and apply this elegant principle in building future AI agents, promoting productivity and innovation.

Mentioned in This Episode

●Products

●Software & Apps

●Companies

●Organizations

●Concepts

●People Referenced

Common Questions

The core argument is that file systems, due to their long history and the vast amount of training data LLMs have on them, serve as a highly effective and native primitive for AI agents, enabling better long-term memory and interaction.

Topics

Ai Agents AI & Machine Learning Technology & Innovation Developer Tools LLM Memory File Systems Unix Philosophy Agent Skills Cloud Storage

Mentioned in this video

Software & Apps

Linux

An operating system whose file system commands are fundamentally understood by all LLMs, making it a superior primitive for AI agents.

OpenAI agents SDK

An SDK that allows defining workspaces for sandbox agents and supports remote file system mounts, including Box.

PostgreSQL

A relational database system mentioned as a potential, but ultimately less suitable, solution for long-term agent memory.

ChatGPT

A well-known LLM that users have likely experimented with, often facing the problem of agents forgetting information.

Claude Code

An LLM mentioned in the context of agents needing long-term memory and its early use of markdown files for skills.

LangChain

A framework with a deep agents SDK that leverages file system semantics for its pluggable backend system, enabling powerful agents.

MySQL

A relational database system mentioned as a potential, but ultimately less suitable, solution for long-term agent memory.

Chroma

A system that the Chroma team worked with Mitlfi to build a shim for, allowing access to Chroma data using file system semantics.

Ubuntu

An operating system used on a Digital Ocean droplet to demonstrate interacting with the /proc directory via file system semantics.

Codex

A system that shipped an update with built-in computer use, capable of manipulating files on a Mac but less so custom applications.

Docker

Software required to be installed to get the Box Sandbox CLI demo up and running.

Vector Databases

Databases discussed as a potential solution for agent memory, noted as fantastic for similarity search but narrowly useful.

Python

The programming language used to build the demonstration application for the Box Sandbox CLI.

Companies

Oracle

A relational database system mentioned as a potential, but ultimately less suitable, solution for long-term agent memory.

GitHub

A platform whose repositories LLMs have been trained on, informing their native understanding of directory structures and files.

Digital Ocean

A cloud platform provider where a Ubuntu droplet was used to demonstrate file system interactions with the /proc directory.

Twilio

A company where Carter Rabasa worked, contributing to his experience in developer tooling.

Concepts

Unix

An operating system whose philosophy and file system commands are fundamentally understood by all LLMs, making it a superior primitive for AI agents.

People

Dennis Ritchie

Co-inventor of Unix, whose work laid the foundation for powerful and simple systems built on atomic building blocks.

Ken Thompson

Co-inventor of Unix, whose work laid the foundation for powerful and simple systems built on atomic building blocks.

Carter Rabasa

The speaker, who leads developer relations at Box and has over 26 years of software building experience, specializing in developer tools.

Organizations

Box

A company that historically helps enterprises manage unstructured data (PDFs, Word, Excel) and is now focusing on unlocking value for AI agents.

Mitlfi

An entity that worked with the Chroma team to build a shim for accessing Chroma data using file system semantics.

Locations

Seattle

The location where Carter Rabasa lives and where the CascadiaJS conference takes place.

Products

Box Drive

A cloud-based file storage solution that can be mounted by the OpenAI agent SDK, allowing agents to interact with it as a local file system.

Ask anything from this episode.

Save it, chat with it, and connect it to Claude or ChatGPT. Get cited answers from the actual content — and build your own knowledge base of every podcast and video you care about.

Get Started Free

AI Dev 26 x SF | Carter Rabasa: File Systems Are the New Primitive for AI Agents

Want to know something specific about what's covered?

Key Insights

The overlooked potential of file systems for AI agents

Why file systems are a natural fit for LLMs

File system semantics enable effective long-term memory

Emerging agent frameworks embrace file system paradigms

Demonstrating file system-based agent interaction

Beyond local files: Unix's broader influence

File systems as bridges for human and agent interaction

Riding the model's priors: the winning engineering principle

Mentioned in This Episode

Common Questions

Topics

Mentioned in this video

More from DeepLearningAI

AI Dev 26 x SF | Vlad Luzin: Herding Cats—The Hidden Challenges of Multi-Agent Autonomy

AI Dev 26 x SF | Melissa Herrera: Your Agents Should Be Durable

AI Dev 26 x SF | Paige Bailey: What's New and What's Next in AI

AI Dev 26 x SF | Aditi Gupta: Building SRE Agents with the Redis Context Engine

Ask anything from this episode.