Why is traditional AI integration in software engineering falling short?

Traditional AI integration often fails to deliver organizational value because code generation was never the main bottleneck; the complexity lies in the broader engineering process. Issues like quality, cost (token subsidies), trust, and potential for 'gaming the system' hinder effective adoption.

What are the key problems with current AI in software development?

Key problems include a lack of durable organizational value despite individual productivity gains, concerns about quality and defects, the high cost of AI tokens, low developer trust in AI accuracy (around 3%), potential for misuse ('token maxing'), and negative impacts on engineer morale and mental health.

How does agentic engineering redefine the role of a software engineer?

Agentic engineering shifts the engineer's role from solely building the final product to building the AI systems that build the product. This involves focusing on systems design, scaffolding, and ensuring humans remain central as augmented collaborators, not bottlenecks.

What are the practical aspects of implementing agentic engineering?

Practical implementation involves focusing on robust evaluations (evals), harnessing agent capabilities and memory, integrating tool usage with secure sandboxing, adopting techniques like red-green testing for agents, and rethinking code organization for modularity and parallel processing.

What is 'vibe coding' and why is agentic engineering preferred?

'Vibe coding' is described as engineers just hoping their generated code works. Agentic engineering is preferred because it brings back the discipline and rigor of engineering principles, focusing on building reliable systems and using structured approaches like spec-driven development.

How can engineering leadership foster agentic engineering?

Leadership needs to shift the narrative from pure profit and automation to innovation and human augmentation. This involves managing stakeholder expectations, addressing fears of obsolescence (F.O.B.B.O.), fostering a culture of discipline, and guiding teams toward building systems that build systems.

What is the future of agentic engineering?

The future involves developing 'agentic patterns'—the next evolution after design patterns—and establishing agentic engineering as a formal discipline. The goal is to move beyond simply automating current workflows towards enabling bold, new innovations by augmenting human potential.

Key Moments

AI Dev 26 x SF | Paul Everitt: The Shift to Agentic Engineering

DeepLearning.AI

Education5 min read29 min video

May 22, 2026|37 views

Save to Pod

Want to know something specific about what's covered?

We've already dissected every moment. Ask and we will deliver (with timestamps).

Key Moments

TL;DR

Agentic engineering promises AI-driven productivity boosts, but rising costs and trust issues threaten to make it a "challenger disaster" if not managed with a renewed focus on core engineering principles.

Key Insights

A 2023 study found 95% of companies saw little to no durable organizational value from AI investments, suggesting code generation speed-up yields only about 10% improvement.

95% of Europeans expressed concern about AI's impact, highlighting potential governance and sovereignty issues distinct from other regions.

Only 3% of developers expressed high confidence in AI-generated code accuracy in 2023, indicating a significant trust gap.

Layoffs have been linked to stock price increases, but phrases like "AI washing" and the emergence of roles like "AI HR managers" suggest a complex job market shift rather than outright elimination.

Agentic engineering aims to build "the thing that builds the thing," shifting the focus from direct coding to systems design, scaffolding, and augmenting human capabilities.

The concept of 'red-green testing' for agents, where a broken test is written first, helps define success and guides agents to mimic desired testing styles and outcomes.

The productivity paradox and current industry challenges

Despite a surge of AI advancements, the software engineering industry faces significant challenges in translating individual AI tool productivity gains into durable organizational value. Early insights from a mid-2023 study indicated that 95% of companies failed to extract substantial benefits from their AI investments. While AI can accelerate coding, it contributes to only a fraction of the overall software engineering process. Studies suggest that speed-ups in coding yield improvements closer to 10%, not the often-touted 10x. Code itself was rarely the primary bottleneck; rather, the broader spectrum of software engineering activities presents far greater complexity. This disconnect between perceived AI potential and tangible organizational wins points to a need for a more strategic approach beyond simply generating more code. The industry is grappling with issues beyond raw output, including quality concerns, rising costs, and a fundamental lack of trust in AI-generated results. This creates a "missed opportunity" to leverage AI for genuine innovation rather than just profit margin squeezing.

Quality, cost, and trust emerge as critical roadblocks

Concerns about quality are paramount, with a noted "50% bad number" for phase defect rates and a fear of "challenger disaster" if humans are removed from the loop, potentially leading to catastrophic errors like accidental table drops. The cost of AI tools, specifically token consumption, is becoming unsustainable, with pricing already beginning to shift upwards. Furthermore, trust remains a significant hurdle; a 2023 survey revealed that only 3% of developers had high confidence in the accuracy of AI-generated code. This lack of trust, coupled with the tendency for employees to "game the system" when provided with new tools, contributes to organizational friction. The "dark factory" pattern, where human oversight is minimized, raises alarms, especially in regulated industries and for European workers who, at 84%, express caution. This environment fuels a "fear of being obsolete" among the workforce.

Agentic engineering: a new framework for the AI era

In response to these challenges, the concept of "agentic engineering" is emerging as a proposed solution. Coined and popularized by figures like Andrej Karpathy and Simon Willison, it represents a shift from "vibe coding"—haphazardly relying on AI—to a more disciplined engineering practice. The core idea is to build "the thing that builds the thing," reminiscent of early software engineering principles. This approach reframes the developer's role from direct code creation to systems design, scaffolding, and maximizing human leverage. OpenAI's "harness engineering" model exemplifies this, focusing on building the infrastructure that enables AI to build solutions. The emphasis is on augmenting, not replacing, humans, recognizing that the human element remains critical, albeit potentially a bottleneck to be managed strategically.

Key practices within agentic engineering

Agentic engineering in practice involves several key disciplines. "Spec-driven development" ensures human input steers AI work and aligns stakeholders. Rigorous "evaluations" are crucial to determine if generated code is good, efficient, and within budget, moving beyond simple "button go clicky" functionality. This necessitates data scientists to analyze model performance, as different LLMs have varied characteristics. "Harness engineering," focusing on owning the AI's memory and tools, is vital. "Tooling" enables agents to execute code securely in sandboxes, a capability being developed by companies like Cloudflare. The "red-green testing" pattern is adapted for agents, where writing a broken test first sets clear expectations and allows the agent to learn and mimic the desired testing style, effectively defining success for the AI.

Rethinking codebases and building for AI collaboration

Working with large, legacy codebases necessitates a re-evaluation of modularity. Principles like "Ojiic engineering" might guide the reorganization of code to support parallel and specialized sub-agents. The development of "QA agents" aims to augment human testers by prepping work or identifying instrumentation needs, allowing them to collect data as they deem fit. This integration requires developers to write code that acts as glue, connecting AI capabilities with the outside world. "Observability," encompassing both general system and AI-specific metrics, becomes critical to monitor these complex interactions. Furthermore, "orchestration" principles are essential for designing coherent, architecturally sound systems, drawing parallels to system design fundamentals that ensure resilience and maintainability.

Leadership, culture, and the call to arms

For leaders, managing the shift to agentic engineering requires addressing the "fear of being obsolete" within teams. The message to stakeholders should pivot from simply automating current processes and replacing workers to fostering innovation and augmenting human capabilities. This shift requires a return to the discipline of engineering, emphasizing its scientific and systematic nature, harkening back to foundational principles. Grady Booch advocates for the development of "agentic patterns," urging the community to define this new discipline. The ultimate goal is to move beyond the hype cycle of "vibe coding" and establish agentic engineering as a robust, principled practice, enabling the creation of novel solutions and finding purpose and joy in the work.

Mentioned in This Episode

●Software & Apps

●Companies

●Organizations

●Concepts

●People Referenced

Agentic Engineering: Dos and Don'ts

Practical takeaways from this episode

Do This

Build the thing that builds the thing.

Augment the human, don't eliminate them.

Prioritize systems design and discipline.

Focus on steering work and aligning stakeholders (spec-driven development).

Get good at evaluations (evals) for agent performance.

Consider harness engineering and owning agent memory.

Leverage agents for tooling and secure code execution.

Write broken tests first for agents to learn your style.

Reorganize code for modularity with parallel and specialized sub-agents.

Develop QA agents to prep work and move capabilities to agents.

Integrate agents with the outside world (e.g., DevTools protocol).

Implement both system and AI observability.

Manage up to stakeholders and down to your team, addressing F.O.B.B.O.

Shift focus from automating the current to innovation and augmenting humans.

Embrace agentic patterns as the next evolution in software engineering.

Avoid This

Don't assume coding is the primary bottleneck in software engineering.

Don't purely focus on profit margin with AI; use it for innovation.

Don't just fix AI-generated problems; fix the root cause.

Don't let the LLM be the sole judge of code quality.

Don't rely on 'vibe coding'; embrace disciplined engineering.

Don't eliminate the human bottleneck; augment it.

Don't over-rely on generic agent responses; use specs for steering.

Don't accept 'button-clicky' results as true engineering.

Don't wait for problems; proactively build better systems.

Don't be afraid of 'AI washing'; focus on genuine engineering value.

Don't just automate and replace; innovate and augment.

AI Impact on Development and Team Sentiment

Data extracted from this episode

Metric	Finding	Source/Context
Organizational Value from AI	Limited compared to individual value.	Speaker observation
AI Productivity Improvement	10% (not 10x)	DX study follow-on
Defect Rate in AI-Generated Code	50% bad rate perceived.	Talk on defects
Confidence in AI Accuracy (Last Year)	3%	Developer survey
Employee Rebellion/Rebellion Sentiment	Engineers outright rebelling.	Fortune Magazine article
Management vs. Engineer Sentiment Gap (Mental Health)	67 points.	Fortune Magazine article
European Trust in AI	84% skeptical ('Eh').	European sentiment data
Work Enabled by AI (Anthropic)	27% of work using 'god box' capabilities.	Anthropic example
Impact of 'Mega Layoffs' on Stock Price	Stock price goes up.	Observation on industry trend
Percentage Involved in Agent-assisted Coding	Most hands raised.	Audience poll
Percentage Shipping Agent Code to Production	About two-thirds.	Audience poll
Proportion of Large Codebases	20% (estimated).	Audience poll
Number of AI Orchestration Startups	Approx. 500 in Series X.	Speaker estimate

Common Questions

Agentic engineering is a paradigm shift in software development where instead of just building software, engineers build the systems or 'agents' that build the software. It focuses on creating more leverage and augmenting human capabilities rather than just automating current tasks.

Topics

Agentic Engineering Ai Tools AI & Machine Learning Technology & Innovation Programming & Software Future Of Work Software Development Developer Productivity AI In Coding Career & Skills Systems Design

Mentioned in this video

People

Ed Zitron

A newsletter writer who has been warning about the inability to keep subsidizing AI tokens and the rising costs.

Sam Altman

Introduced the concept of 'AI washing' and is seen as a counter-example to the idea that jobs are solely being eliminated, suggesting job creation might be occurring.

Simon Willison

Referred to as the 'developer AI developer whisperer' and co-founder of Django, who is concerned about quality issues with AI in production and is writing a book on agentic engineering.

Dave Farley

Associated with modern software engineering and definitions that emphasize engineering as more than just coding.

Daron Acemoglu

A Nobel Prize winner mentioned for his work on the productivity problem in economics and software engineering, highlighting the missed opportunities with AI.

Grady Booch

Creator of UML, who discussed the third golden age of software engineering and emphasized that coding was never the whole picture, also pushing back on agentic engineering being just software engineering.

Addi Osmani

Author of a blog post on Agentic Engineering, who also discussed 'vibe coding'.

Companies

Lasten

Mentioned among companies that conducted 'mega layoffs' in response to the evolving industry landscape, with stock prices reportedly increasing afterward.

Ask anything from this episode.

Save it, chat with it, and connect it to Claude or ChatGPT. Get cited answers from the actual content — and build your own knowledge base of every podcast and video you care about.

Get Started Free