What are OpenAI's key optimization goals for coding models?

OpenAI optimizes for more than just raw intelligence, focusing on user experience enhancements like providing context during tool calls (preambles) and improving token efficiency for lower latency and cost. They also aim for consistency over long trajectories and better agentic tool calling.

What aspects of agentic workflows are working well, and what remains brittle?

The ability to write and operate code over long trajectories is working well. However, inferring ambiguous user intent and precise tool selection remain more brittle areas that require careful tuning and development.

What is the next major unlock for AI agents like Codeex?

The next unlock is increasing trust in AI agents to perform end-to-end tasks autonomously for extended periods, potentially for hours. This 'set it and forget it' capability would allow humans to focus on higher-level interpretation and decision-making.

How can startup founders develop better 'taste' for what to build in the AI era?

Taste is developed through rapid iteration and learning from market feedback. Founders should prioritize getting products live quickly, adapting, and rebuilding, leveraging the faster development cycles enabled by AI tools.

Which industries are surprisingly fast adopters of AI?

Beyond software companies, the legal and healthcare sectors have shown remarkable speed in AI adoption. Both industries deal with vast amounts of unstructured data, require complex reasoning, and have highly paid professionals who can benefit from AI assistance.

What will the role of a software engineer look like in five years?

Software engineering will not disappear but will evolve. The abstraction layer is rising, potentially leading engineers to manage more tasks via agents. The definition may broaden to include anyone producing software, with a greater emphasis on building, improving, and maintaining complex systems.

Key Moments

AI Dev 26 x SF | A Fireside Chat with OpenAI's Marc Manara

DeepLearning.AI

Education5 min read25 min video

May 20, 2026|391 views|3

Save to Pod

Want to know something specific about what's covered?

We've already dissected every moment. Ask and we will deliver (with timestamps).

Key Moments

TL;DR

Software engineering isn't dying, but engineers will manage AI agents instead of writing basic code. This expands the definition of an "engineer" to anyone producing software, leading to an explosion of new creators.

Key Insights

OpenAI partners with startups like Cursor and Augment to help them optimize their use of new models, working with them on 'harnesses' that are trained alongside the models for better performance.

GPT-5.5 demonstrated a significant step-change in intelligence and token efficiency, reducing "thinking tokens" and output lengths to lower latency and cost without sacrificing quality.

The ability of AI to write code and operate over long trajectories (e.g., 30 minutes to hours) is becoming very good, but inferring ambiguous user intent and selecting the correct tools from a large pool remain more brittle areas.

The next big unlock is achieving greater trust in AI agents to run end-to-end for hours without stopping or making wrong decisions, enabling a "set it and forget it" workflow for developers.

Startup teams are becoming smaller, with an emphasis on hiring exceptional, cross-functional individuals who combine strong product sense with deep technical skills, and who embrace rapid iteration.

Software companies (like DoorDash, Canva, Figma) are moving fastest in AI adoption, but surprisingly, the legal and healthcare sectors are also showing rapid adoption due to unstructured data and reasoning needs.

OpenAI's collaborative approach with AI startups

OpenAI actively collaborates with startups building on their AI models, such as Cursor, Augment, and Cognition. Marc Manara's team focuses on helping these companies move beyond naive model usage to optimal performance tuning. This involves working closely with them, sometimes before a new model launches, to test and refine its capabilities. They help tailor the "harness" – the software layer that interacts with the model – to work synergistically with the model's training, creating a virtuous cycle where both improve. This partnership extends to solving practical challenges for startups, like reducing latency without sacrificing performance or handling ambiguous user instructions more effectively, ensuring end-users get the best possible experience.

Optimizing AI models for coding and user experience

OpenAI dedicates significant effort to making its models excel at coding, using benchmarks like Sweetbench Pro to measure raw intelligence. However, they recognize that end-user experience involves more than just accuracy. For instance, with GPT-5.5, a key focus was on "preambles" or "thinking tokens" – enabling agents to communicate their reasoning and trajectory to the user during complex, multi-tool operations. This enhances user understanding and trust. Another major optimization is token efficiency. GPT-5.5 achieved better intelligence with fewer tokens, leading to lower latency and cost without compromising output quality. This focus on efficiency is crucial for making AI tools responsive and affordable for widespread adoption.

Progress and brittleness in agentic AI workflows

AI's ability to write code and execute tasks over extended periods (minutes to hours) is rapidly improving, with capabilities now considered "very good." However, certain areas remain more "brittle." One such challenge is inferring ambiguous user intent, especially from non-technical users who provide simpler prompts. The AI must accurately guess the user's true goal and act on it without overstepping or underdelivering. Another brittle area is tool selection, where models need to intelligently choose the right tools from a vast pool, a problem OpenAI is addressing with features like "tool search." Successfully navigating these nuances is critical for building reliable AI agents that can handle complex operational tasks.

The quest for greater AI trust and autonomy

The "next big unlock" in AI, according to Manara, is achieving a higher level of trust in AI agents to operate autonomously over much longer durations, potentially for hours. Currently, developers might not fully trust an agent to complete an entire end-to-end task without intervention, stopping early, or making incorrect decisions. As AI agents become more capable of extended operation, the bottleneck is shifting from AI execution to human cognition – understanding and directing the numerous simultaneous tasks being performed. This "set it and forget it" capability is key to unlocking new levels of productivity and enabling developers to focus on higher-level strategy.

The shift from 'how to build' to 'what to build'

The landscape for startups has fundamentally changed from struggling with 'how to build' to focusing on 'what to build.' Manara, a former founder, emphasizes that the speed of iteration is now paramount. Unlike a decade ago, where building software was a slow slog, today's tools allow for extremely rapid cycles of development, market testing, and adaptation. This rapid iteration is no longer just an advantage but a necessity in a competitive environment. The value of "updating quickly," even if imperfect initially, is preferred over lengthy deliberation, enabling founders to learn from real-world usage and pivot swiftly.

The rise of smaller, cross-functional startup teams

Modern AI-native startups are observed to have remarkably small teams, often comprising only five to ten people, yet achieving significant revenue (tens of millions ARR). This lean structure aims to avoid the bureaucracy that hinders larger organizations. Hiring focuses on exceptional, "swarm" tactics to recruit individuals with a potent blend of strong product sense and deep technical skills. Roles are less siloed, with greater blending of product, engineering, and design. The ideal candidate can leverage AI tools for heavy lifting while possessing the technical understanding to guide and rein them in, spending more time on conceptualizing what to build and less on the granular coding aspect.

Overcoming speed bumps in AI adoption

While speed is a startup's primary advantage, challenges like code reviews, security, governance, and compliance can slow down adoption. Many startups address these "just-in-time" as needs arise. Larger, digital-native companies are surprisingly aggressive in adopting AI, often driven by top-down leadership buy-in and a culture of experimentation. They provide tools and freedom, encouraging workforce-wide AI integration to maintain pace with startups. While overcoming ingrained bureaucracy and legacy systems is tough for enterprises, innovative strategies like automated code reviews and human-in-the-loop processes can enable faster shipping, preventing them from losing ground to agile startups.

The future of the software engineer role

Manara firmly believes software engineering is not dying but is undergoing a significant transformation. The abstraction layer is moving upwards, shifting engineers from managing programming languages to managing multiple autonomous AI agents. This evolution expands the definition of an "engineer" to encompass anyone capable of producing software using these advanced tools. Consequently, instead of a decrease in demand, the author predicts an "explosion" of individuals creating and maintaining software. While traditional computer science education may need to adapt, the foundational understanding of software development will remain crucial for this burgeoning field.

Mentioned in This Episode

●Software & Apps

●Companies

●Organizations

●Studies Cited

●Concepts

●People Referenced

AI Native Startup Strategy & Team Building

Practical takeaways from this episode

Do This

Embrace rapid iteration and update quickly based on market feedback.

Hire exceptional, technically deep individuals with a product-oriented mindset, keeping teams small.

Blend product, engineering, and design roles for cross-functional agility.

Leverage AI tools to accelerate development and focus on 'what to build' rather than just 'how to build'.

Allow teams freedom to experiment with AI tools and build agents.

For enterprises, secure top-down leadership buy-in for AI adoption.

Automate processes like code reviews and ensure human oversight where critical.

Avoid This

Avoid getting bogged down by bureaucracy; keep teams lean.

Don't over-specify AI tasks; learn to infer and act on ambiguous user intent cautiously.

Don't neglect foundational technical skills, even when using AI tools.

Avoid rigid swim lanes between product, engineering, and design.

Don't be overly concerned with tool sprawl or early costs in the experimentation phase.

Don't let legacy tech/business debt or bureaucracy slow down adoption compared to startups.

Common Questions

OpenAI partners closely with companies to help them optimize the use of their AI models. This involves working with them before model launches, testing candidate models, and jointly developing parts of their technical 'harness' to ensure the best performance and user experience.

Topics

AI & Machine Learning Technology & Innovation Business & Entrepreneurship Agentic Workflows Startup Strategy AI Development LLM Optimization AI Adoption Trends Future Of Software Engineering AI In Enterprise

Mentioned in this video

People

Marc Manara

A speaker at the event, who leads teams at OpenAI focused on helping startups utilize their models. Previously, he worked at AWS, co-founded a food delivery startup, and was an engineer and investor.

Software & Apps

Liora

A company experiencing record-breaking growth in the legal space due to strong demand for AI solutions.

Abridge

A health system product designed to assist clinicians by saving them time, indicating rapid AI adoption in the healthcare sector.

Notion

A company noted for being on top of AI adoption and moving exceptionally fast, characteristic of digital-native scale-ups.

Canvas

A company identified as a fast-moving, digital-native scale-up that is aggressively adopting AI technologies.

Replit

A coding product that allows users, including non-technical ones, to interact with AI for coding tasks. It's mentioned in the context of inferring user intent.

HE Health

A product used within health systems to give clinicians back time, showing the fast pace of AI adoption in healthcare.

AWS

Amazon Web Services, where Marc Manara previously led startup teams.

Augment

A company that works with OpenAI, utilizing their models. It's an example of an AI-native company building on OpenAI's technology.

Ambiance

A product used in health systems to help clinicians save time, reflecting the quick adoption of AI in the healthcare industry.

Cursor

A company that works with OpenAI, using their models. It's mentioned as an example of an AI-native company building on OpenAI's technology.

GPT-5

A recently shipped model from OpenAI that demonstrates improvements in intelligence and token efficiency, focusing on fewer thinking tokens and shorter solution lengths.

Slack

A communication platform that Codeex can be used to navigate, as part of its broader knowledge work capabilities.

Harvey

A company noted for its record-breaking growth in the legal sector, driven by the strong adoption of AI tools.

Companies

Emergent

A coding product mentioned in the context of inferring user intent and handling ambiguous instructions, especially for non-technical users.

Figma

A company mentioned as a prime example of a fast-moving, digital-native scale-up that is aggressively adopting AI technologies.

Codeex

OpenAI's product that is used for coding tasks and broader knowledge work. It is described as an agentic coding product that can be trusted for longer trajectories.

Citadel Securities

A financial firm whose research suggests a rise in demand for software engineering roles, contrasting with claims of the role's demise.

OpenAI

An AI research and deployment company that builds models for various applications. The company partners with startups to help them optimize the use of their AI models.

Cognition

A company that partners with OpenAI, utilizing their models. Mentioned as an example of an AI-native company leveraging OpenAI's technology.

DoorDash

A company mentioned as an example of a fast-moving, digital-native scale-up that is aggressively adopting AI technologies.

Organizations

Acumen

An organization where Marc Manara worked as an investor.

UC Berkeley

University where the host teaches entrepreneurship, emphasizing the 'you are not your user' principle.

Studies & Research

Terminal Bench

A benchmark used to measure progress in raw intelligence for coding tasks, assessing capabilities relevant to software engineers.

Sweetbench

A benchmark used to assess the raw intelligence of AI models, particularly for software engineering tasks like writing, reviewing, and refactoring code.

Concepts

Pareto frontier

A concept used to describe the trade-off between token efficiency and accuracy in AI models, aiming for high accuracy with minimal tokens.

Ask anything from this episode.

Save it, chat with it, and connect it to Claude or ChatGPT. Get cited answers from the actual content — and build your own knowledge base of every podcast and video you care about.

Get Started Free