Key Moments

AI Dev 26 x SF | A Fireside Chat with OpenAI's Marc Manara

DeepLearning.AIDeepLearning.AI
Education5 min read25 min video
May 20, 2026|137 views|1
Save to Pod

Want to know something specific about what's covered?

We've already dissected every moment. Ask and we will deliver (with timestamps).

TL;DR

Software engineering isn't dying, but engineers will manage AI agents instead of writing basic code. This expands the definition of an "engineer" to anyone producing software, leading to an explosion of new creators.

Key Insights

1

OpenAI partners with startups like Cursor and Augment to help them optimize their use of new models, working with them on 'harnesses' that are trained alongside the models for better performance.

2

GPT-5.5 demonstrated a significant step-change in intelligence and token efficiency, reducing "thinking tokens" and output lengths to lower latency and cost without sacrificing quality.

3

The ability of AI to write code and operate over long trajectories (e.g., 30 minutes to hours) is becoming very good, but inferring ambiguous user intent and selecting the correct tools from a large pool remain more brittle areas.

4

The next big unlock is achieving greater trust in AI agents to run end-to-end for hours without stopping or making wrong decisions, enabling a "set it and forget it" workflow for developers.

5

Startup teams are becoming smaller, with an emphasis on hiring exceptional, cross-functional individuals who combine strong product sense with deep technical skills, and who embrace rapid iteration.

6

Software companies (like DoorDash, Canva, Figma) are moving fastest in AI adoption, but surprisingly, the legal and healthcare sectors are also showing rapid adoption due to unstructured data and reasoning needs.

OpenAI's collaborative approach with AI startups

OpenAI actively collaborates with startups building on their AI models, such as Cursor, Augment, and Cognition. Marc Manara's team focuses on helping these companies move beyond naive model usage to optimal performance tuning. This involves working closely with them, sometimes before a new model launches, to test and refine its capabilities. They help tailor the "harness" – the software layer that interacts with the model – to work synergistically with the model's training, creating a virtuous cycle where both improve. This partnership extends to solving practical challenges for startups, like reducing latency without sacrificing performance or handling ambiguous user instructions more effectively, ensuring end-users get the best possible experience.

Optimizing AI models for coding and user experience

OpenAI dedicates significant effort to making its models excel at coding, using benchmarks like Sweetbench Pro to measure raw intelligence. However, they recognize that end-user experience involves more than just accuracy. For instance, with GPT-5.5, a key focus was on "preambles" or "thinking tokens" – enabling agents to communicate their reasoning and trajectory to the user during complex, multi-tool operations. This enhances user understanding and trust. Another major optimization is token efficiency. GPT-5.5 achieved better intelligence with fewer tokens, leading to lower latency and cost without compromising output quality. This focus on efficiency is crucial for making AI tools responsive and affordable for widespread adoption.

Progress and brittleness in agentic AI workflows

AI's ability to write code and execute tasks over extended periods (minutes to hours) is rapidly improving, with capabilities now considered "very good." However, certain areas remain more "brittle." One such challenge is inferring ambiguous user intent, especially from non-technical users who provide simpler prompts. The AI must accurately guess the user's true goal and act on it without overstepping or underdelivering. Another brittle area is tool selection, where models need to intelligently choose the right tools from a vast pool, a problem OpenAI is addressing with features like "tool search." Successfully navigating these nuances is critical for building reliable AI agents that can handle complex operational tasks.

The quest for greater AI trust and autonomy

The "next big unlock" in AI, according to Manara, is achieving a higher level of trust in AI agents to operate autonomously over much longer durations, potentially for hours. Currently, developers might not fully trust an agent to complete an entire end-to-end task without intervention, stopping early, or making incorrect decisions. As AI agents become more capable of extended operation, the bottleneck is shifting from AI execution to human cognition – understanding and directing the numerous simultaneous tasks being performed. This "set it and forget it" capability is key to unlocking new levels of productivity and enabling developers to focus on higher-level strategy.

The shift from 'how to build' to 'what to build'

The landscape for startups has fundamentally changed from struggling with 'how to build' to focusing on 'what to build.' Manara, a former founder, emphasizes that the speed of iteration is now paramount. Unlike a decade ago, where building software was a slow slog, today's tools allow for extremely rapid cycles of development, market testing, and adaptation. This rapid iteration is no longer just an advantage but a necessity in a competitive environment. The value of "updating quickly," even if imperfect initially, is preferred over lengthy deliberation, enabling founders to learn from real-world usage and pivot swiftly.

The rise of smaller, cross-functional startup teams

Modern AI-native startups are observed to have remarkably small teams, often comprising only five to ten people, yet achieving significant revenue (tens of millions ARR). This lean structure aims to avoid the bureaucracy that hinders larger organizations. Hiring focuses on exceptional, "swarm" tactics to recruit individuals with a potent blend of strong product sense and deep technical skills. Roles are less siloed, with greater blending of product, engineering, and design. The ideal candidate can leverage AI tools for heavy lifting while possessing the technical understanding to guide and rein them in, spending more time on conceptualizing what to build and less on the granular coding aspect.

Overcoming speed bumps in AI adoption

While speed is a startup's primary advantage, challenges like code reviews, security, governance, and compliance can slow down adoption. Many startups address these "just-in-time" as needs arise. Larger, digital-native companies are surprisingly aggressive in adopting AI, often driven by top-down leadership buy-in and a culture of experimentation. They provide tools and freedom, encouraging workforce-wide AI integration to maintain pace with startups. While overcoming ingrained bureaucracy and legacy systems is tough for enterprises, innovative strategies like automated code reviews and human-in-the-loop processes can enable faster shipping, preventing them from losing ground to agile startups.

The future of the software engineer role

Manara firmly believes software engineering is not dying but is undergoing a significant transformation. The abstraction layer is moving upwards, shifting engineers from managing programming languages to managing multiple autonomous AI agents. This evolution expands the definition of an "engineer" to encompass anyone capable of producing software using these advanced tools. Consequently, instead of a decrease in demand, the author predicts an "explosion" of individuals creating and maintaining software. While traditional computer science education may need to adapt, the foundational understanding of software development will remain crucial for this burgeoning field.

AI Native Startup Strategy & Team Building

Practical takeaways from this episode

Do This

Embrace rapid iteration and update quickly based on market feedback.
Hire exceptional, technically deep individuals with a product-oriented mindset, keeping teams small.
Blend product, engineering, and design roles for cross-functional agility.
Leverage AI tools to accelerate development and focus on 'what to build' rather than just 'how to build'.
Allow teams freedom to experiment with AI tools and build agents.
For enterprises, secure top-down leadership buy-in for AI adoption.
Automate processes like code reviews and ensure human oversight where critical.

Avoid This

Avoid getting bogged down by bureaucracy; keep teams lean.
Don't over-specify AI tasks; learn to infer and act on ambiguous user intent cautiously.
Don't neglect foundational technical skills, even when using AI tools.
Avoid rigid swim lanes between product, engineering, and design.
Don't be overly concerned with tool sprawl or early costs in the experimentation phase.
Don't let legacy tech/business debt or bureaucracy slow down adoption compared to startups.

Common Questions

OpenAI partners closely with companies to help them optimize the use of their AI models. This involves working with them before model launches, testing candidate models, and jointly developing parts of their technical 'harness' to ensure the best performance and user experience.

Topics

Mentioned in this video

Software & Apps
Liora

A company experiencing record-breaking growth in the legal space due to strong demand for AI solutions.

Abridge

A health system product designed to assist clinicians by saving them time, indicating rapid AI adoption in the healthcare sector.

Notion

A company noted for being on top of AI adoption and moving exceptionally fast, characteristic of digital-native scale-ups.

Canvas

A company identified as a fast-moving, digital-native scale-up that is aggressively adopting AI technologies.

Replit

A coding product that allows users, including non-technical ones, to interact with AI for coding tasks. It's mentioned in the context of inferring user intent.

HE Health

A product used within health systems to give clinicians back time, showing the fast pace of AI adoption in healthcare.

AWS

Amazon Web Services, where Marc Manara previously led startup teams.

Augment

A company that works with OpenAI, utilizing their models. It's an example of an AI-native company building on OpenAI's technology.

Ambiance

A product used in health systems to help clinicians save time, reflecting the quick adoption of AI in the healthcare industry.

Cursor

A company that works with OpenAI, using their models. It's mentioned as an example of an AI-native company building on OpenAI's technology.

GPT-5

A recently shipped model from OpenAI that demonstrates improvements in intelligence and token efficiency, focusing on fewer thinking tokens and shorter solution lengths.

Slack

A communication platform that Codeex can be used to navigate, as part of its broader knowledge work capabilities.

Harvey

A company noted for its record-breaking growth in the legal sector, driven by the strong adoption of AI tools.

More from DeepLearningAI

View all 80 summaries

Ask anything from this episode.

Save it, chat with it, and connect it to Claude or ChatGPT. Get cited answers from the actual content — and build your own knowledge base of every podcast and video you care about.

Get Started Free