How did the founders of Factory AI meet?

Matan and Eno, the co-founders of Factory AI, met at the Langchain hackathon in 2023. Despite having many mutual friends from Princeton, they had never spoken one-on-one until this event, where they quickly bonded over their shared obsession with AI for software development.

Why did Factory AI choose the name 'Droids'?

Initially incorporated as 'San Francisco Droid Company,' they were advised to change the name due to potential trademark issues with Lucasfilm. They settled on 'Factory' for its automation connotations. 'Droids' remained as a term for their AI agents to differentiate from the generic and often unreliable 'agent' concept prevalent at the time, and customers have embraced the name.

How does the Factory AI platform handle the software development lifecycle?

The platform offers specialized 'droids' for key use cases: knowledge/writing, coding, and incident response. The code droid allows users to delegate tasks by pasting tickets or referencing codebases. It uses tools like search and integrates with project management systems, providing an activity log and context panel for transparency.

What is the significance of Factory AI's browser-based interface over an IDE?

The founders believe the IDE, designed for humans writing 100% of code, is not optimal for the future of software development where AI handles significant portions. They liken it to iterating from horses to cars and aim to build a new interface from first principles, anticipating developers will spend more time planning and testing, and less time writing code.

How does Factory AI measure the success and ROI of its platform for enterprises?

While metrics like commits or lines of code are popular, Factory AI emphasizes developer sentiment and tangible project timelines. For enterprises, the key ROI comes from drastically reducing timelines for large tasks like migrations (e.g., from 4 months to 3.5 days) or completing projects within a quarter that were initially planned for longer.

What are the main challenges Factory AI faces regarding AI models?

The primary challenge is the need for models trained on more general, long-term agentic trajectories. Models that can effectively maintain a goal-directed behavior over hours of work on complex problems are still developing. Factory AI is developing benchmarks to address this, focusing on post-training techniques.

What type of talent is Factory AI finding it difficult to hire?

They are looking for individuals who are highly technical but also possess strong sales and customer-facing skills, capable of interfacing with CIOs and VPs of engineering, as well as working directly with developers on their use cases. They describe the ideal candidate as someone 'highly technical' and akin to a 'junior Eno'.

Key Moments

The AI Coding Factory

Latent Space Podcast

Science & Technology4 min read60 min video

May 29, 2025|7,809 views|162|15

Save to Pod

Want to know something specific about what's covered?

We've already dissected every moment. Ask and we will deliver (with timestamps).

Key Moments

TL;DR

Factory.ai builds autonomous software engineering droids for enterprises, handling code generation to incident response.

Key Insights

Factory.ai offers autonomous software engineering droids for enterprises, focusing on full SDLC automation.

The company targets underserved enterprise needs, particularly with legacy codebases, differentiating from tools for solo developers.

Factory's 'droids' act as autonomous agents, capable of planning, decision-making, and environmental grounding for complex tasks.

The platform emphasizes a 'delegation' workflow over collaboration, moving beyond traditional IDE constraints.

Factory uses a usage-based pricing model, providing direct token usage transparency to users.

The company believes developer roles will shift from 100% code writing to increased focus on planning, understanding, and testing AI-generated code.

FOUNDING AND VISION

Factory.ai founders Eno Reyes and Matan Grinberg met at a hackathon, bonding over their shared obsession with AI for software development. Despite initial limitations in AI models, they recognized the potential for autonomous agents in software engineering. Their vision extends beyond simple code generation, aiming to automate the entire software development lifecycle, particularly for large enterprises. This focus stems from a desire to tackle complex, real-world problems in often under-served segments of the market.

TARGETING THE ENTERPRISE MARKET

Unlike many AI coding tools focused on solo developers or rapid prototyping, Factory.ai has strategically targeted the enterprise market. This involves addressing the complexities of large, often decades-old codebases found in enterprises. While these use cases may not be as visually appealing as quick demos, the potential value and impact on developer productivity are significant. The company believes this underserved segment presents a massive opportunity for impactful automation.

THE CONCEPT OF 'DROIDS' AND AUTONOMOUS AGENTS

Factory.ai refers to its autonomous software engineering systems as 'droids,' distinguishing them from the more common, sometimes unreliable 'agent' terminology that implies endless, unguided loops. These droids are designed to be more robust, guided by planning, decision-making, and environmental grounding. They can maintain goal-oriented behavior over extended periods without hard-coded constraints, making them suitable for complex, multi-step software development tasks, including incident response.

REIMAGINING THE SOFTWARE DEVELOPMENT WORKFLOW

Factory.ai's platform challenges the traditional developer workflow by moving from a collaborative model to a more delegative one. This shift is enabled by developing an infrastructure that is not confined by the constraints of legacy IDEs, which were designed for human-centric code writing. By freeing themselves from latency and cost limitations inherent in IDE-based tools, Factory.ai can rethink the optimal user interface and interaction patterns for an AI-driven software development future where human coding output is significantly reduced.

KEY USE CASES AND PLATFORM DEMONSTRATION

The Factory.ai platform offers specialized 'droids' for key use cases, including knowledge and technical writing, code generation and modification, and incident response. During a demonstration, a code droid was tasked with a ticket, showcasing its ability to perform semantic searches, access integrated tools (like GitHub and Jira), and develop a plan. The user interaction involves providing clarifications and preferences, allowing the droid to execute tasks, with clear visibility into its activity log and context panel.

INTEGRATIONS AND PROACTIVE INSIGHTS

Factory.ai's droids integrate with a wide array of enterprise tools such as Linear, Jira, Slack, GitHub, and PagerDuty. Crucially, the platform aims to be proactive by synthesizing 'synthetic insights' on codebases, including setup instructions and module connections, rather than solely relying on reactive information requests. This approach provides developers with essential context, mimicking how a human engineer would be onboarded with access to all necessary information sources.

PRICING, EFFICIENCY, AND METRICS

The company employs a usage-based pricing model directly tied to token consumption, emphasizing transparency over abstract credits. Factory.ai prioritizes token efficiency, particularly in retrieval mechanisms, to manage costs effectively, even with large codebases. Success is measured not just by raw usage but by tangible deliverables like pull requests created and code merged. They also analyze metrics like code churn for enterprise clients to identify potential quality issues in AI-generated code.

THE FUTURE OF SOFTWARE DEVELOPMENT AND AI

The founders believe the role of a software developer will dramatically change, with a decrease in lines of code written by humans and an increase in time spent on planning, understanding, and testing AI-generated code. True test-driven development is expected to flourish in this AI-driven environment. The platform's browser-based interface is a departure from IDEs, representing a fundamental rethinking of the optimal UI for a future where delegation to AI is paramount.

MODEL EVALUATION AND CONTINUOUS IMPROVEMENT

Factory.ai continuously evaluates new AI models by focusing on desired versus actual behavior, using a combination of task-based benchmarks and high-level behavioral principles. They are developing internal benchmarks and adapting to new model capabilities, such as those with enhanced reasoning or longer context windows. A significant challenge is ensuring that their tools effectively utilize the advanced capabilities of new models and combat any inherent preferences introduced by reinforcement learning.

LIMITING FACTORS AND FUTURE DEVELOPMENT

Current limitations include the need for models with longer-term directed behavior over extended periods (hours) and the challenge of semantic observability in enterprise environments where code data is inaccessible. While not currently planning extensive model customization, they are building benchmarks for post-training techniques. The company is actively hiring, particularly for roles that bridge technical expertise with client-facing sales and success, aiming to grow their go-to-market strategy for large enterprises.

Mentioned in This Episode

●Software & Apps

●Tools

●Companies

●Studies Cited

●Concepts

●People Referenced

Common Questions

Factory AI builds autonomous systems for the end-to-end software development lifecycle, specifically for enterprises. Unlike tools focused on solo developers or code generation, Factory AI targets complex, legacy codebases and offers a more delegative workflow, aiming to fundamentally reimagine the developer platform beyond the traditional IDE.

Topics

AI & Machine Learning Technology & Innovation Programming & Software Enterprise AI Code Generation LLM Integration Developer Productivity AI For Software Development Autonomous Agents Legacy Code Modernization AI Workflow Automation

Mentioned in this video

Companies

Desible

Company where Allesio is Partner and CTO.

Factory AI

Company founded by Matan and Eno, focused on autonomous systems for the software development lifecycle.

PagerDuty

An incident response platform integrated with Factory AI.

Together AI

A company with whom Factory AI previously discussed inference speed requirements.

GitHub

A platform integrated with Factory AI for code management and pull request creation.

Amplitude

A tool company that may be closer to providing semantic observability.

DataDog

A monitoring and analytics platform mentioned as an essential information source for engineers.

OpenAI

Provider of the operator benchmark, which involved human testers attempting to work on hard problems for extended periods.

Uber

Company where some of the Factory AI team previously worked and gained experience in observability.

Small AI

Company founded by co-host Swixs.

Software & Apps

Cursor

An IDE that has its own rule parsing, which Factory AI's system can also ingest.

LangSmith

A fantastic observability tool for LLMs, used by Factory AI.

Gemini

A language model mentioned in the context of users wanting longer context windows.

Jira

A project management tool integrated with Factory AI, used for creating tickets and epics for migration tasks.

GPT-3.5

A language model available at the time, considered insufficient for a fully autonomous engineering agent.

LangChain

A framework mentioned as not being used by Factory AI, though they use Langsmith.

Linear

A project management tool integrated with Factory AI.

Slack

A communication platform integrated with Factory AI, used for sharing information and updates.

grep

A command-line utility for searching plain-text data sets, used as a comparison for model preferences.

Locations

Italy

A European nation mentioned as an example of a large public company with a hospital system needing modernization.

Germany

A European nation mentioned as an example of a large public company with a hospital system needing modernization.

Concepts

scaling laws

Research papers establishing the predictable improvement in LLM capabilities with increased scale.

Chain of Thought

A reasoning technique for LLMs discussed in papers, contributing to the observed improvement in model reasoning.

Studies & Research

Sweetbench

A benchmark used for evaluating LLMs, which Factory AI no longer competes on due to its irrelevance to enterprise use cases.

Media

XKCD

A webcomic referenced for a joke about everyone having a blank rules file.

People

Henry Ford

Quoted for the analogy of iterating from horses to cars to explain the need for a new approach to software development.

Ask anything from this episode.

Save it, chat with it, and connect it to Claude or ChatGPT. Get cited answers from the actual content — and build your own knowledge base of every podcast and video you care about.

Get Started Free