Key Moments

Agents @ Work: Dust.tt — with Stanislas Polu

Latent Space PodcastLatent Space Podcast
Science & Technology4 min read59 min video
Nov 11, 2024|2,696 views|34|4
Save to Pod
TL;DR

Dust.tt's Stanislas Polu discusses AI agents, OpenAI, and building horizontal platforms.

Key Insights

1

Stanislas Polu transitioned from Stripe and OpenAI to found Dust.tt with a focus on building a horizontal AI agent platform.

2

His work at OpenAI focused on math and AI, exploring large language models' reasoning capabilities in formal mathematics.

3

Dust.tt aims to provide infrastructure for companies to deploy AI agents for their teams, emphasizing a product-driven approach over research.

4

The platform prioritizes enabling non-developers to create agents through natural language instructions, contrasting with vertical agent solutions.

5

Polu believes in the power of horizontal platforms to unlock emergent use cases and emphasizes API integrations over UI automation for internal company systems.

6

Dust.tt leverages tools like Temporal for orchestration and focuses on making AI accessible and non-intimidating to end-users.

JOURNEY FROM STRIPE AND OPENAI TO FOUNDING DUST.TT

Stanislas Polu shared his career path, which included roles at Oracle, Stripe, and OpenAI before he co-founded Dust.tt. His experience at Stripe, known for its engineering culture, and his time at OpenAI, particularly working on math and AI research, provided a strong foundation. Polu was part of the early wave of "OpenAI Alumnus Founders," leaving in September 2022, before the release of ChatGPT. This background shaped his vision for Dust.tt, moving from advanced research to building practical AI tools.

EXPLORING AI RESEARCH AT OPENAI

At OpenAI, Polu was dedicated to the "math and AI" research area, focusing on large language models' (LLMs) ability to perform formal mathematical reasoning. The goal was to combine the creativity of LLMs with the verification capabilities of formal systems, a challenging but promising endeavor. He noted that while Transformers are creative, they make mistakes, and formal math systems provide mechanical verification. This work was aligned with OpenAI's mission and received the necessary compute resources for progress.

THE BIRTH OF DUST.TT: A HORIZONTAL AGENTS PLATFORM

Polu's motivation for starting Dust.tt stemmed from the realization that immense value could be unlocked by productizing LLMs, particularly GPT-4, which he saw internally before its public release. He observed a gap between the models' capabilities and their deployment. Dust.tt was conceived as a horizontal agents platform, aiming to build infrastructure that enables companies to deploy AI agents within their teams. The core idea is to empower non-developers to create agents that solve operational tasks.

PRODUCT-DRIVEN DEVELOPMENT AND ACCESSIBILITY

Dust.tt's strategy is deliberately product-focused, eschewing its own research and model training to leverage existing frontier models. The platform prioritizes user experience, especially for non-technical individuals. Polu emphasized the importance of making the technology feel accessible and non-intimidating, using user-friendly language like "instructions" instead of "system prompts." This approach contrasts with deep research efforts and focuses on practical deployment and value generation.

INFRASTRUCTURE, CONNECTORS, AND ORCHESTRATION

A significant part of Dust.tt's work involves building robust infrastructure, including connections to various data sources and tools. This "boring infrastructure work" is crucial for enabling agents to access data and take actions. The platform uses tools like Temporal for asynchronous work and orchestration, ensuring that agents can handle complex workflows. Polu also discussed the challenge of integrating with diverse company systems, prioritizing API-based interactions over UI automation where possible.

THE HORIZONTAL VS. VERTICAL AGENTS DEBATE

Polu strongly advocates for a horizontal approach to AI agents, believing it unlocks emergent use cases across different teams and functions within a company. While vertical solutions might have easier go-to-market strategies, they often have limited potential. Dust.tt's horizontal platform aims to provide a general-purpose tool that can be adapted to various needs, allowing for more widespread adoption and value creation within an organization by empowering users to build their own solutions.

FUNCTION CALLING AND MODEL AGNOSTICISM

Dust.tt places a high emphasis on function calling, recognizing its critical role in agent performance. The platform supports multiple LLMs, allowing users to choose the model that best suits their needs, though a default is provided for ease of use. Polu noted that while models are improving, the core value lies in precise instructions and the ability for agents to correctly interpret and execute function calls, enabling more complex workflows and meta-agents.

THE FUTURE OF SAS AND ENTERPRISE UTILIZATION

Polu speculated on the future of Software as a Service (SaaS) in the age of AI, suggesting that highly capable AI agents might reduce the need for some traditional SaaS products, especially within tech companies that can build their own solutions. He also discussed the concept of "maximum enterprise utilization," where AI agents help companies tackle work that was previously impossible due to a lack of human resources, potentially leading to smaller, more efficient teams achieving significant output.

Common Questions

Dust.tt aims to build the infrastructure for companies to deploy AI agents within their teams. It focuses on providing tools for non-developers to create agents for operational tasks, bridging the gap between complex AI capabilities and everyday business needs.

Topics

Mentioned in this video

Software & Apps
Dust.tt

A company founded by Stanislas Polu, focused on building infrastructure for AI agents.

Lean

A formal proof system mentioned in the context of AI and mathematics research.

GPT-3

A key product from OpenAI that received significant compute resources.

XLM

A large action model from Salesforce mentioned on a leader board for function calling.

GPT-2

Mentioned as a previous generation of models in the research path at OpenAI.

GPT-4

A major model from OpenAI, its capabilities were recognized by Stanislas Polu as creating significant value.

WebGPT

An early success from OpenAI that used models to traverse and summarize the web.

LangChain

A competitor to Dust.tt in the developer tooling space for AI.

Auto-GPT

An example of a highly autonomous AI agent approach, contrasted with Dust's more controlled method.

Slack

An example of a system with good API integration for AI agents.

Claude 3

A model family from Anthropic known for its function calling capabilities and chain-of-thought steps.

ChatGPT

Mentioned as a product of OpenAI that influenced the talent flow to the company.

XP1

A browser extension developed as an early product from Dust.tt.

OpenAI GPT-4 Turbo

A specific model variant discussed for its function calling capabilities.

Mini

A model ranked on a function calling leader board.

Next.js

A popular JavaScript framework used for building web applications, mentioned in the context of Dust.tt's development stack.

Notion

A productivity tool for which Dust.tt ensures structured data capture for AI models.

Compose

Mentioned as a company providing integrations.

Remix

A web framework that ChatGPT reportedly rewrote from Next.js into.

ffmpeg

A multimedia framework that could be wrapped with GPT for a video editing agent.

Jasper

Mentioned as an example of a company already selling AI products to enterprises.

adept

A company building AI agents, discussed in comparison to Dust's approach.

Sonnet

A specific model within the Claude 3 family, noted for its function calling innovation.

AWS

The successful product spin-off of Amazon, highlighting a platform strategy.

All Hands

Mentioned as a company providing integrations.

Perplexity

A company that spun off its internal search technology into a separate product, illustrating a product-over-platform strategy.

More from Latent Space

View all 145 summaries

Found this useful? Build your knowledge library

Get AI-powered summaries of any YouTube video, podcast, or article in seconds. Save them to your personal pods and access them anytime.

Try Summify free