Key Moments

AI Dev 26 x SF | Paul Everitt: The Shift to Agentic Engineering

DeepLearning.AIDeepLearning.AI
Education5 min read29 min video
May 22, 2026|37 views
Save to Pod

Want to know something specific about what's covered?

We've already dissected every moment. Ask and we will deliver (with timestamps).

TL;DR

Agentic engineering promises AI-driven productivity boosts, but rising costs and trust issues threaten to make it a "challenger disaster" if not managed with a renewed focus on core engineering principles.

Key Insights

1

A 2023 study found 95% of companies saw little to no durable organizational value from AI investments, suggesting code generation speed-up yields only about 10% improvement.

2

95% of Europeans expressed concern about AI's impact, highlighting potential governance and sovereignty issues distinct from other regions.

3

Only 3% of developers expressed high confidence in AI-generated code accuracy in 2023, indicating a significant trust gap.

4

Layoffs have been linked to stock price increases, but phrases like "AI washing" and the emergence of roles like "AI HR managers" suggest a complex job market shift rather than outright elimination.

5

Agentic engineering aims to build "the thing that builds the thing," shifting the focus from direct coding to systems design, scaffolding, and augmenting human capabilities.

6

The concept of 'red-green testing' for agents, where a broken test is written first, helps define success and guides agents to mimic desired testing styles and outcomes.

The productivity paradox and current industry challenges

Despite a surge of AI advancements, the software engineering industry faces significant challenges in translating individual AI tool productivity gains into durable organizational value. Early insights from a mid-2023 study indicated that 95% of companies failed to extract substantial benefits from their AI investments. While AI can accelerate coding, it contributes to only a fraction of the overall software engineering process. Studies suggest that speed-ups in coding yield improvements closer to 10%, not the often-touted 10x. Code itself was rarely the primary bottleneck; rather, the broader spectrum of software engineering activities presents far greater complexity. This disconnect between perceived AI potential and tangible organizational wins points to a need for a more strategic approach beyond simply generating more code. The industry is grappling with issues beyond raw output, including quality concerns, rising costs, and a fundamental lack of trust in AI-generated results. This creates a "missed opportunity" to leverage AI for genuine innovation rather than just profit margin squeezing.

Quality, cost, and trust emerge as critical roadblocks

Concerns about quality are paramount, with a noted "50% bad number" for phase defect rates and a fear of "challenger disaster" if humans are removed from the loop, potentially leading to catastrophic errors like accidental table drops. The cost of AI tools, specifically token consumption, is becoming unsustainable, with pricing already beginning to shift upwards. Furthermore, trust remains a significant hurdle; a 2023 survey revealed that only 3% of developers had high confidence in the accuracy of AI-generated code. This lack of trust, coupled with the tendency for employees to "game the system" when provided with new tools, contributes to organizational friction. The "dark factory" pattern, where human oversight is minimized, raises alarms, especially in regulated industries and for European workers who, at 84%, express caution. This environment fuels a "fear of being obsolete" among the workforce.

Agentic engineering: a new framework for the AI era

In response to these challenges, the concept of "agentic engineering" is emerging as a proposed solution. Coined and popularized by figures like Andrej Karpathy and Simon Willison, it represents a shift from "vibe coding"—haphazardly relying on AI—to a more disciplined engineering practice. The core idea is to build "the thing that builds the thing," reminiscent of early software engineering principles. This approach reframes the developer's role from direct code creation to systems design, scaffolding, and maximizing human leverage. OpenAI's "harness engineering" model exemplifies this, focusing on building the infrastructure that enables AI to build solutions. The emphasis is on augmenting, not replacing, humans, recognizing that the human element remains critical, albeit potentially a bottleneck to be managed strategically.

Key practices within agentic engineering

Agentic engineering in practice involves several key disciplines. "Spec-driven development" ensures human input steers AI work and aligns stakeholders. Rigorous "evaluations" are crucial to determine if generated code is good, efficient, and within budget, moving beyond simple "button go clicky" functionality. This necessitates data scientists to analyze model performance, as different LLMs have varied characteristics. "Harness engineering," focusing on owning the AI's memory and tools, is vital. "Tooling" enables agents to execute code securely in sandboxes, a capability being developed by companies like Cloudflare. The "red-green testing" pattern is adapted for agents, where writing a broken test first sets clear expectations and allows the agent to learn and mimic the desired testing style, effectively defining success for the AI.

Rethinking codebases and building for AI collaboration

Working with large, legacy codebases necessitates a re-evaluation of modularity. Principles like "Ojiic engineering" might guide the reorganization of code to support parallel and specialized sub-agents. The development of "QA agents" aims to augment human testers by prepping work or identifying instrumentation needs, allowing them to collect data as they deem fit. This integration requires developers to write code that acts as glue, connecting AI capabilities with the outside world. "Observability," encompassing both general system and AI-specific metrics, becomes critical to monitor these complex interactions. Furthermore, "orchestration" principles are essential for designing coherent, architecturally sound systems, drawing parallels to system design fundamentals that ensure resilience and maintainability.

Leadership, culture, and the call to arms

For leaders, managing the shift to agentic engineering requires addressing the "fear of being obsolete" within teams. The message to stakeholders should pivot from simply automating current processes and replacing workers to fostering innovation and augmenting human capabilities. This shift requires a return to the discipline of engineering, emphasizing its scientific and systematic nature, harkening back to foundational principles. Grady Booch advocates for the development of "agentic patterns," urging the community to define this new discipline. The ultimate goal is to move beyond the hype cycle of "vibe coding" and establish agentic engineering as a robust, principled practice, enabling the creation of novel solutions and finding purpose and joy in the work.

Agentic Engineering: Dos and Don'ts

Practical takeaways from this episode

Do This

Build the thing that builds the thing.
Augment the human, don't eliminate them.
Prioritize systems design and discipline.
Focus on steering work and aligning stakeholders (spec-driven development).
Get good at evaluations (evals) for agent performance.
Consider harness engineering and owning agent memory.
Leverage agents for tooling and secure code execution.
Write broken tests first for agents to learn your style.
Reorganize code for modularity with parallel and specialized sub-agents.
Develop QA agents to prep work and move capabilities to agents.
Integrate agents with the outside world (e.g., DevTools protocol).
Implement both system and AI observability.
Manage up to stakeholders and down to your team, addressing F.O.B.B.O.
Shift focus from automating the current to innovation and augmenting humans.
Embrace agentic patterns as the next evolution in software engineering.

Avoid This

Don't assume coding is the primary bottleneck in software engineering.
Don't purely focus on profit margin with AI; use it for innovation.
Don't just fix AI-generated problems; fix the root cause.
Don't let the LLM be the sole judge of code quality.
Don't rely on 'vibe coding'; embrace disciplined engineering.
Don't eliminate the human bottleneck; augment it.
Don't over-rely on generic agent responses; use specs for steering.
Don't accept 'button-clicky' results as true engineering.
Don't wait for problems; proactively build better systems.
Don't be afraid of 'AI washing'; focus on genuine engineering value.
Don't just automate and replace; innovate and augment.

AI Impact on Development and Team Sentiment

Data extracted from this episode

MetricFindingSource/Context
Organizational Value from AILimited compared to individual value.Speaker observation
AI Productivity Improvement10% (not 10x)DX study follow-on
Defect Rate in AI-Generated Code50% bad rate perceived.Talk on defects
Confidence in AI Accuracy (Last Year)3%Developer survey
Employee Rebellion/Rebellion SentimentEngineers outright rebelling.Fortune Magazine article
Management vs. Engineer Sentiment Gap (Mental Health)67 points.Fortune Magazine article
European Trust in AI84% skeptical ('Eh').European sentiment data
Work Enabled by AI (Anthropic)27% of work using 'god box' capabilities.Anthropic example
Impact of 'Mega Layoffs' on Stock PriceStock price goes up.Observation on industry trend
Percentage Involved in Agent-assisted CodingMost hands raised.Audience poll
Percentage Shipping Agent Code to ProductionAbout two-thirds.Audience poll
Proportion of Large Codebases20% (estimated).Audience poll
Number of AI Orchestration StartupsApprox. 500 in Series X.Speaker estimate

Common Questions

Agentic engineering is a paradigm shift in software development where instead of just building software, engineers build the systems or 'agents' that build the software. It focuses on creating more leverage and augmenting human capabilities rather than just automating current tasks.

Topics

Mentioned in this video

More from DeepLearningAI

View all 94 summaries

Ask anything from this episode.

Save it, chat with it, and connect it to Claude or ChatGPT. Get cited answers from the actual content — and build your own knowledge base of every podcast and video you care about.

Get Started Free