How does the new Apps SDK enhance user experience with ChatGPT integrations?

The Apps SDK shifts the paradigm from embedding chatbots into websites to having ChatGPT as the primary interface, with applications and services embedded within it. This creates a more integrated and natural user experience, similar to how Canva was integrated during the keynote.

What is Agent Kit and what can developers build with it?

Agent Kit is a comprehensive set of solutions for building, deploying, and optimizing AI agents. It includes components like Agent Builder, Connector Registry, Track Kit, and Eval, allowing developers to create complex agent workflows, integrate tools, and manage production deployment.

How does Agent Builder facilitate agent development?

Agent Builder serves as a playground for modeling and iterating on agent systems, allowing prompt optimization and testing. It also offers a path to deploying agents without needing to manage the infrastructure, with features like visual nodes and templates for common use cases.

What is OpenAI's stance on interoperability between different agent builder platforms?

OpenAI is exploring interoperability, particularly for the Agent SDK. They are assessing how to standardize elements like stateful API responses and agent workflows to allow for greater portability across different models and platforms.

How does OpenAI handle evaluations (evals) for agents, and what's next?

The Evals product now supports agent evaluations by utilizing traces generated by the Agent SDK. While still a work in progress, the focus is on allowing detailed measurement of individual trace components and incorporating human-in-the-loop feedback for more comprehensive assessments.

What are the key benefits of using Codex for developers?

Codex acts as a powerful AI assistant for developers, significantly speeding up workflows. Tips include trusting the model to handle larger tasks and features, leveraging its capabilities for code generation, and utilizing its high-quality PR reviews to improve code quality and context switching.

What is the Service Health Dashboard and why is it important?

The Service Health Dashboard provides real-time personal SLOs for an organization's integration with OpenAI's API. It tracks token velocity, response codes, and throughput, offering insights into integration health and supporting OpenAI's ongoing efforts to improve reliability.

Key Moments

DevDay 2025: Apps SDK, Agent Kit, MCP, Codex and why Prompting is More Important than Ever

Latent Space Podcast

Science & Technology5 min read45 min video

Oct 7, 2025|5,361 views|98|4

Save to Pod

Key Moments

On this page

TL;DR

OpenAI DevDay introduces Agent Kit, Apps SDK, and MCP, emphasizing agent building, app integration within chat, and enhanced developer tools. Prompting remains crucial.

Key Insights

OpenAI is enhancing developer tools with Agent Kit for streamlined agent creation and deployment, and the Apps SDK for integrating applications directly within chat interfaces.

The MCP protocol, adopted by OpenAI, is highlighted as a valuable open standard for tool integration, fostering interoperability and simplifying agent development.

Agent Builder provides a visual, user-friendly interface for creating and iterating on AI agents, supporting complex workflows and human-in-the-loop interactions.

Prompt optimization is more critical than ever, with ongoing research and development focusing on automated prompt tuning and embedding these capabilities into the platform.

OpenAI is prioritizing reliability and transparency with the new Service Health Dashboard, aimed at giving developers real-time insights into their API integration performance.

The company is exploring ways to make inference costs more manageable for developers, with 'bring your own key' options being a discussed future possibility.

EVOLUTION OF DEVELOPER ENGAGEMENT AND PLATFORM GROWTH

OpenAI's DevDay 2025 saw a notable increase in developer engagement, with 4 million registered developers, up from the previous year. This growth reflects the company's ongoing commitment to empowering third-party developers to bring AGI benefits to the world. The event's improved organization, including dedicated podcast studios, signifies a maturing approach to developer outreach, building upon feedback from previous iterations like GPTs and plugins. The focus remains on iteratively releasing tools that foster developer innovation and expand the reach of AI technologies.

APPS SDK AND THE INVERTED PARADIGM OF CHAT INTEGRATION

The Apps SDK represents a significant shift by embedding applications directly within ChatGPT, inverting the traditional model of separate chatbots on websites. This allows for a more integrated and context-aware user experience, as demonstrated with integrations like Canva. This approach, learned from previous efforts like plugins, emphasizes giving developers more control over their brand and user experience, making the integration feel seamless and native within the ChatGPT environment.

AGENT KIT: COMPREHENSIVE TOOLS FOR BUILDING AI AGENTS

Agent Kit is introduced as a full suite of solutions designed to simplify the process of building, deploying, and optimizing AI agents. It addresses the complexities builders face with prompt engineering, optimization, and evaluation. Key components include the Agent SDK for programmatic control, the Agent Builder for visual workflow creation, a Connector Registry for tool integration (leveraging MCP), Trackit for tracing and debugging, and Evals for performance assessment, offering an end-to-end system for agent development.

MCP PROTOCOL AND INTEROPERABILITY AMONG TOOLS

OpenAI's adoption of the Message Passing Protocol (MCP) highlights its commitment to open standards for tool integration within agents. Developed by Anthropic and treated as an open protocol by the community, MCP simplifies connecting various tools and services. This interoperability is crucial for building sophisticated agents, allowing developers to leverage existing ecosystems and promoting a standardized approach to tool access, thereby reducing development friction and encouraging broader adoption.

AGENT BUILDER: VISUAL WORKFLOWS AND FLEXIBLE DEPLOYMENT

The Agent Builder offers a visual, canvas-like interface for designing AI agent workflows, making complex processes more accessible. It supports features like user approval nodes and state management, enabling near Turing-complete functionality. Developers can use it as a playground for iteration and prompt optimization, then export their work to the Agent SDK, or leverage OpenAI's deployment services. The builder also benefits from pre-built templates for common use cases like customer support and data enrichment.

ENHANCING EVALUATION AND PROMPT OPTIMIZATION

The Evals product has been expanded to support agent evaluations, allowing for the assessment of longer, more complex agent traces. This includes breaking down traces for granular analysis and incorporating human-in-the-loop feedback. Automated prompt optimization is a key area of investment, aiming to tie directly into evaluations to continuously improve agent performance. This is particularly important as prompting remains a critical skill, despite predictions of its obsolescence, with ongoing research like GDPA influencing future developments.

THE CONTINUED IMPORTANCE OF PROMPTING AND KODEX BENEFITS

Contrary to early predictions, prompting has become more entrenched and important in AI development. OpenAI recognizes this by investing in prompt optimization tools. Meanwhile, Codex continues to accelerate development, with users increasingly trusting it for larger coding tasks and feature development. Its capabilities extend to code reviews, improving developer productivity by providing starting points and aiding in code navigation, thereby reducing context-switching time and facilitating faster onboarding into codebases.

RELIABILITY AND DEVELOPER TRANSPARENCY WITH SERVICE HEALTH DASHBOARD

A significant, though not stage-highlighted, launch is the Service Health Dashboard. This tool provides developers with real-time, organization-scoped insights into their API integration performance, including token velocity, throughput, and response codes. This initiative stems from a deep focus on reliability, particularly after past outages, aiming to offer developers transparency and peace of mind as OpenAI works towards achieving higher levels of uptime, such as five nines.

THE EXPANDING ECOSYSTEM OF CONNECTORS AND WIDGETS

The Connector Registry and the ChatKit widgets are central to building a robust AI application ecosystem. While OpenAI offers first-party connectors, the MCP protocol encourages third-party development, creating a diverse range of tool integrations. Similarly, ChatKit's embeddable iframe and widget builder aim to become a drop-in solution for common chat experiences, offering a polished, consumer-grade UI that potentially reduces the need for developers to reinvent these fundamental components.

FUTURE DIRECTIONS: MODALITIES, EXTERNALIZATION, AND COST MANAGEMENT

Future developments for Agent Kit include expanded modalities (like voice), more logical nodes for diverse workflows, and enabling users to run agent workflows independently of ChatKit. The platform is also evolving to address developer concerns around inference costs, with 'bring your own key' being a potential future feature. This aligns with the vision of making ChatGPT a personal assistant by integrating deeply with user workflows and identity, as seen in partnerships with Apple and Kakao.

Mentioned in This Episode

●Software & Apps

●Companies

●Organizations

●Concepts

Common Questions

The key takeaway is OpenAI's continued commitment to empowering developers by opening up their technology. New tools like the Apps SDK and Agent Kit aim to make it easier for developers to build and deploy sophisticated AI applications, leveraging ChatGPT's distribution and advanced agent capabilities.

Topics

Agent Kit Agent Builder SDKs API Design OpenAI DevDay LLM Workflows

Mentioned in this video

Software & Apps

Thinky

Referred to as a 'cheeky one' by the interviewer, likely a colloquial or internal name for a tool/API.

Connector Registry

A part of the Agent Kit framework for managing connections to various tools and services.

RFT fine-tuning API

OpenAI's API for fine-tuning models, mentioned as being grouped outside the Agent Kit umbrella.

Agent Kit

A set of solutions launched by OpenAI to help developers build, deploy, and optimize AI agents.

Track Kit

A component of Agent Kit, described as an embeddable iframe for tracking agent activity.

ChatKit

A component of Agent Kit that handles the chat user experience and provides widgets.

Agent SDK

A software development kit for building agents that can interact with tools and APIs.

Plugins

An earlier feature allowing ChatGPT to interact with external tools and services.

GPTs

Customizable versions of ChatGPT that users can create for specific purposes.

Agent Builder

A component of Agent Kit that allows developers to visually model, iterate, and deploy agents.

Eval

A tool for evaluating the performance of AI models and agents, integrated into Agent Kit.

Media

Late in Space

The podcast being recorded at the OpenAI DevDay studio.

Found this useful? Build your knowledge library

Get AI-powered summaries of any YouTube video, podcast, or article in seconds. Save them to your personal pods and access them anytime.

Get Started Free