How does Adept differentiate itself from other AI agent companies?

Adept prioritizes enterprise reliability and focuses on an augmentation model rather than fully automated solutions. They are not a developer company selling APIs or open-source models, but rather an enterprise-focused company delivering out-of-the-box solutions for complex workflows.

What is the significance of reliability for Adept's agents?

Reliability is critical, especially for enterprise use cases where even a 60% success rate is unacceptable. Adept's focus on reliability drives a different technological approach and ensures their agents can handle complex workflows, like dispatching physical assets.

How does Adept's approach to AI agents differ from a chatbot?

While chatbots are a common form factor, Adept believes agents should be capable of executing end-to-end workflows based on goals. Their focus is on enabling agents to interact with software like a human, going beyond simple conversational interfaces.

Why is the interaction layer between AI agents and software important?

Most software doesn't have APIs for every possible action. Adept's approach allows agents to control the user interface like a human, which is seen as a more practical path for widespread adoption and learning from user interactions.

Why should someone work at Adept instead of other AI companies?

Adept is focused on building useful agents, grounding research in actual customer needs and a reward signal from customers. They believe this approach accelerates AI development compared to pure research labs or companies focused solely on base model scaling.

What are the future trends in AI development according to David Luan?

David Luan believes the era of rapid industrialization in AI has begun, moving beyond just compute and data. He emphasizes the importance of deep co-design between products and users, and that companies focusing on practical, agent-based solutions will lead the way.

Key Moments

Why Google failed to make GPT-3 -- with David Luan of Adept

Q: What is the trade-off between reliability and generality in AI agents?

There's a perceived trade-off where increasing reliability might reduce generality. Adept aims to overcome this by framing the agent problem to benefit from data collection, allowing use cases to drive data generation for improved performance without sacrificing core capabilities.

Latent Space Podcast

Science & Technology5 min read50 min video

Mar 27, 2024|1,354 views|41|2

david luan adept ai agents latent space swyx alessio fanelli

Save to Pod

Key Moments

On this page

TL;DR

David Luan discusses AI agents, the evolution of AI research from OpenAI to Google, and Adept's focus on reliable enterprise AI agents.

Key Insights

AI agents are a natural evolution towards AGI, defined as AI capable of performing any human computer task.

Google's historical challenges in scaling LLMs were due to internal resource allocation issues (Brain Credit Marketplace).

Adept prioritizes reliability in AI agents, focusing on enterprise solutions over a developer-first API model.

The future of AI progress will be driven by the co-design of products, user feedback, and technology.

Multimodal models are becoming the default foundation model, with a shift towards training on knowledge work data (charts, tables, UIs) rather than just natural images.

Adept's strategy as an augmentation company fosters a data flywheel by incorporating human oversight and feedback.

FROM OPENAI TO GOOGLE BRAIN: A CAREER IN AI LEADERSHIP

David Luan recounts his early career, starting with Dextro, an AI company acquired by ExxonMobil. He then joined OpenAI as an early hire, leading engineering and influencing its technical direction towards large-scale bets rather than solely novel research. Later, at Google, he co-led Google Brain and led the company's LLM efforts, gaining insights into the different eras of AI research and the importance of resource allocation for massive projects.

THE SHIFT IN AI RESEARCH: FROM RESEARCH PAPERS TO PRODUCT CO-DESIGN

Luan categorizes AI research into distinct eras, marking 2017 as a pivotal year with the advent of Transformers. He observes that recent AI progress, particularly in the next few years, will be driven by the deep co-design and co-evolution of products and users. This feedback loop, combined with technological advancements, is crucial for labs and companies aiming for success, influencing his decision to found Adept.

AGENTS AS THE FUTURE: DEFINING AND BUILDING AI CAPABILITY

Luan views AI agents as the correct long-term direction for AGI, defining it not by human replacement but by AI's ability to outperform humans in economically valuable tasks. He emphasizes a more tractable definition: an AI model that can do anything a human can do on a computer. He also highlights the field's rediscovery of RL lessons, arguing that LLMs act as a powerful 'behavioral cloner' of human knowledge, paving the way for universal models.

THE CHALLENGE OF SCALING AT GOOGLE VERSUS OPENAI'S STRATEGY

Luan explains Google's internal challenges in scaling LLMs, particularly the 'Brain Credit Marketplace,' which hindered critical mass acquisition for large projects. In contrast, OpenAI's focused strategy on making 'giant swings and bets' and executing them at all costs proved more effective. This focus on scientific outcomes, regardless of novelty, became dominant, highlighting the importance of resource allocation and strategic direction.

ADEPT'S MISSION: RELIABLE AI AGENTS FOR ENTERPRISE WORKFLOWS

Adept aims to build AI agents that assist humans with any task on a computer, translating natural language goals into actionable steps across various software tools. Unlike many in the space, Adept is an 'enterprise company,' focusing on out-of-the-box solutions for complex workflows rather than selling APIs or open-source models. Reliability is paramount, distinguishing them from other agent demos that function less consistently.

PRIORITIZING RELIABILITY AND AUGMENTATION OVER AUTOMATION

Adept's commitment to 'nines of reliability' has led them down a different technological path, prioritizing dependability over speed or breadth in early stages. They see agents as augmenting human capabilities rather than fully replacing them, which creates a data flywheel. This approach allows them to learn from skilled humans and continuously improve AI by tackling tasks too complex for current models alone.

THE EVOLUTION OF MULTIMODAL MODELS AND KNOWLEDGE WORK

Luan discusses the development of multimodal models, emphasizing a shift from training on natural images to data relevant to knowledge work, such as charts, graphs, and UIs. This focus is crucial for agents operating in enterprise environments. Adept has invested heavily in building fast, multimodal models adept at understanding screens and dense OCR, forming the foundation for their reliable agent technology.

INTERACTION LAYERS: FROM GUIS TO GENERATIVE AGENT INTERFACES

The interaction layer for AI agents is evolving. Luan draws an analogy to the transition from command-line interfaces to GUIs. He posits that agents will control current GUIs and APIs, but future interfaces may become generative, offering a new standard interaction layer. This shift means software will increasingly be controlled by agents, with humans potentially interfacing less directly with underlying applications.

NAVIGATING THE TRADE-OFFS: RELIABILITY, GENERALITY, AND COST

Luan acknowledges the inherent trade-off between reliability and generality in AI development, especially at the frontier. Adept addresses this by framing agent problems to benefit from data collection, ensuring each use case contributes to broader learning rather than being overly prescriptive. This approach helps manage costs and speed while maintaining a high standard of performance, distinguishing them from companies solely focused on fine-tuning base models.

ADEPT'S ADVANTAGE: AGENT STACK OVER BASE MODELS

Adept believes its advantage lies not just in base models but in building a comprehensive agent stack through vertical integration. This allows for efficient allocation of resources, focusing on agents specifically trained for knowledge work tasks rather than general-purpose foundation models. They see pure-play foundation model companies being outcompeted by advancements in open-source models and massive compute investments from larger players.

THE FUTURE OF AI: COMPUTE, DATA, AND INDUSTRIALIZATION

Luan identifies compute and data as critical drivers of AI progress, with an increasing ability to trade one for the other. He believes the rapid industrialization era of AI has begun, necessitating an embrace of this shift. While acknowledging the importance of research, he stresses that practical application and deployment are becoming increasingly dominant factors in the field's advancement.

Mentioned in This Episode

●Products

●Software & Apps

●Companies

●Organizations

●Concepts

●People Referenced

Common Questions

Adept aims to build AI agents capable of performing any task a human does on a computer. They focus on turning natural language goals into actionable steps across various software tools, with the goal of providing an AI teammate to enhance productivity.

Topics

Ai Agents AI & Machine Learning Technology & Innovation Enterprise AI Large Language Models Human-computer Interaction AI Reliability Multimodal Models

Mentioned in this video

Companies

Dextro

A company founded by David Luan in college that developed the first real-time video detection classification API, which led to its acquisition by Exxon.

NVIDIA

A key hardware partner for AI development, working closely with OpenAI and Adept on GPU technology and co-design.

Microsoft

Invested in OpenAI and participated in crucial pitch meetings where the GPT-2 demo was presented.

Figure

An AI-affiliated company focused on robotics and embodied agents.

Inflection AI

A company focused on building AI agents, with Kaj Jun representing their perspective on working there versus Adept.

OpenAI

David Luan was an early hire at OpenAI, serving as VP of Engineering for over two years, and involved in leading large model initiatives.

Salesforce

Mentioned as an example of a system where an AI agent could perform tasks, potentially abstracting the direct user interface.

Organizations

Google Brain

David Luan was a lead at Google Brain and led Google's large language model (LLM) efforts.

People

Amy Hood

CFO of Microsoft, present during the pitch meeting with OpenAI.

David Luan

CEO and co-founder of Adept, with a background in AI research and development at companies like Dextro, Exxon, Google Brain, and OpenAI.

Noam Shazeer

A researcher at Google who discussed trillion-parameter models in 2017.

Sam Altman

CEO of OpenAI, involved in the Microsoft pitch meeting for OpenAI and mentioned as a proponent of agents being the future.

Kareem Met

Authored a New York Times piece on AI agents.

Amelia Wattenberg

Gave a talk at Adept's conference about alternatives to the chatbot interface for AI delegation.

Bill Gates

Stated that agents are the future in a blog post.

Ilya Sutskever

Co-founder of OpenAI, mentioned for his advocacy for data centers and large-scale AI.

Jensen Huang

CEO of NVIDIA, recognized for his legendary role and support in the early days of AI hardware development.

Alec Radford

Associated with early GPT work at OpenAI, known for his obsession with Transformers and applying them to sentiment analysis.

Kevin Scott

CTO of Microsoft, present during the pitch meeting with OpenAI.

Kaj Jun

From Inflection AI, discussed why people should work at Inflection AI instead of Adept.

Products

A00

A type of NVIDIA GPU generation mentioned in the context of co-design and its potential use for model training, specifically its quad sparsity feature.

Software & Apps

adept

A company co-founded by David Luan focused on building AI agents that can perform any task a human can on a computer, prioritizing reliability for enterprise use.

GPT-2

An earlier version of OpenAI's language model, discussed as a pivotal paper and a demonstration of focused research efforts.

Fuyu

An architecture developed by Adept focused on multimodal models optimized for knowledge work and understanding screens.

GPT-4 Vision

A multimodal model from OpenAI that can process visual information, relevant to building more capable AI agents.

GPT-3

A subsequent large language model from OpenAI, its scaling up presented a challenge and stress for research teams at Google due to resource competition.

Act One

An early demo released by Adept, showcasing an AI agent interacting with tools like Redfin and Google Sheets, serving as a 'hello world' for agent demos.

Transformers

A neural network architecture that revolutionized NLP and is fundamental to modern LLMs and AI agent development.

Llama

Open-source LLMs that are becoming increasingly capable, posing a challenge to pure-play foundation model companies.

Windows 3.1

Used as an analogy for the evolution of user interfaces, comparing the transition from command line to GUI to the future of AI agent interfaces.

Found this useful? Build your knowledge library

Get AI-powered summaries of any YouTube video, podcast, or article in seconds. Save them to your personal pods and access them anytime.

Get Started Free