Key Moments

The AI Frontier: from FLOPs to Megawatts — Anjney Midha, AMP

Latent Space PodcastLatent Space Podcast
Science & Technology5 min read61 min video
Jun 18, 2026|196 views|14
Save to Pod

Want to know something specific about what's covered?

We've already dissected every moment. Ask and we will deliver (with timestamps).

TL;DR

AI compute utilization is surprisingly low, often below 70% MFU, indicating massive waste that common sense infrastructure practices could fix, and community backlash against data centers is a significant bottleneck.

Key Insights

1

96% node utilization should be standard, but most single-down clusters are not running at that, and best-in-class MFU (Model FLOPs Utilization) is only 60-70%.

2

Up to 20% of data centers in the US might face community backlash this year, risking project approval due to concerns about power grids and the environment.

3

Amp aims to be an independent system operator for compute, analogous to the electric grid, pooling supply and demand to make 'megaflops flow like megawatts'.

4

The 'bitter lesson' for AI scaling doesn't excuse abandoning common sense in infrastructure; AI scaling should increase the premium on robust infrastructure due to higher costs of wastage.

5

Anthropic's success is attributed to years of 'preparedness' and efficiency, focusing on a P0 mission (like coding) and having a strong 'culture of safety' that acts as a moat.

6

Venture capitalists often fail to recognize dynamic agents in AI, boxing researchers into narrow roles and overlooking that high-level scientific achievement often translates to strong CEO potential.

Wasted compute: the hidden cost of AI scaling

Anjney Midha highlights a critical inefficiency in AI compute utilization. While 96% node utilization is considered standard (an outage at Google if below 95%), most AI clusters operate far below this. Even more concerning is Model FLOPs Utilization (MFU), where the best-in-class performance is only between 60-70%. This indicates massive underutilization of expensive GPU resources. Midha attributes this not to a lack of funding or compute, but to a 'culture' problem and a lack of alignment between those funding compute and those deploying it. He argues that the rapid scaling demands in AI have led to compounded wastage, a phenomenon that common sense and iterative bring-ups, principles long understood in the semiconductor industry, could significantly mitigate.

Community backlash and infrastructure reliability

The expansion of data centers, crucial for AI development, faces significant community resistance. Midha notes that up to 20% of data centers in the US may be at risk this year due to community backlash, stemming from concerns over power grids, environmental impact, and permitting. This highlights a critical bottleneck that goes beyond technical capabilities. He proposes an innovative solution: data center operators could charge a marginal premium on compute (e.g., an extra $0.50 per hour) and direct these funds to local communities. This would create a clear public benefit, fostering community support and ensuring more reliable infrastructure, effectively turning potential opposition into partnership.

Amp's vision for a compute grid

Amp aims to revolutionize AI infrastructure by creating a 'compute grid' that functions like the electric grid, making 'megaflops flow like megawatts.' This involves a horizontal, multi-cloud, and multi-silicon approach focused on pooling and utilization, rather than vertical integration. Acting as an independent system operator (ISO), Amp will coordinate supply from various partners and demand from research labs and AI companies. This model, inspired by historical grid operators like PJM, focuses on neutrality and aggregation of uncorrelated demand to maximize utilization and create a fungible compute market, addressing the current fragmentation and stranded pools of compute.

The limits of 'move fast and break things' in AI

While the hustle and hacker mindset is valuable for startups, Midha argues that AI infrastructure requires a shift towards 'responsible infrastructure.' The uncontrolled pursuit of speed without stable foundations can lead to systemic failures. He draws a parallel to Mark Zuckerberg's evolution from 'move fast, break things' to emphasizing stable infrastructure. In AI, the margin for error and the cost of wastage are significantly higher, making common sense and robust infrastructure non-negotiable. Abandoning these principles in the name of AI progress is a mistake that will inevitably lead to an accounting for unforeseen consequences.

Venture capital's view on talent and culture

Midha criticizes the venture capital community's tendency to pigeonhole individuals, particularly scientists and researchers, into predefined roles. He argues that top-tier scientists, who have already demonstrated immense performance and discipline, often possess the qualities needed to be great CEOs. The example of Anastasia, co-creator of ChatGPT and founder of LM Arena, illustrates this point, showcasing her ability to excel not only in research but also in building impactful projects. Midha advocates for funding these 'star athletes of the mind,' recognizing that the drive for scientific rigor can translate into strong leadership and a mission-driven approach, rather than forcing them into conventional CEO molds.

Anthropic's preparedness and the culture of safety

Anthropic's success, particularly in coding capabilities, is attributed to a deliberate, long-term strategy of 'preparedness,' not just luck. For four years, the company has been highly efficient, focusing on a P0 mission (coding) and cultivating a 'culture of safety.' This involves a constant awareness of the risks associated with powerful AI systems and a commitment to responsible development, even if it means delaying product launches. This focus on safety, despite potential accusations of over-caution, has served as a durable moat, reinforcing their mission alignment and making them uniquely positioned to handle mission-critical AI applications.

The mission-driven approach at Periodic Labs

Periodic Labs exemplifies a mission-driven approach, centered on scientific breakthroughs, particularly in areas like superconductivity and end-of-life prediction in healthcare. The company's challenges, like attracting talent and navigating technical hurdles, serve as the 'hardship' that hones its culture. Midha emphasizes that true mission alignment, especially when resources are scarce, forces teams to define their P0 (priority zero) and make difficult trade-offs. This contrasts with labs that raise excessive capital too quickly, diluting focus and potentially leading to a fragile culture that fails to reach its full potential.

Output maximization and the nature of progress

The core philosophy driving Midha's work, and exemplified by 'output maxing,' is achieving optimal outcomes through AI. This involves pushing the frontier of capabilities while minimizing waste – whether it's computational resources, human potential, or healthcare expenditures. He sees AI as a tool to move beyond simplistic analogies and reasoning, encouraging a return to first principles. Progress, he suggests, comes from either standardizing protocols for lossless communication or developing entirely new capabilities that unlock abundance, making standardization less critical. This pursuit of maximal, efficient output is central to his vision for the future of AI infrastructure and research.

Common Questions

Many AI labs struggle to ship products despite having sufficient cash and compute. The speaker diagnoses this as a cultural issue, where a lack of consistent action demonstrating mission alignment causes culture to fray.

Topics

Mentioned in this video

More from Latent Space

View all 227 summaries

Ask anything from this episode.

Save it, chat with it, and connect it to Claude or ChatGPT. Get cited answers from the actual content — and build your own knowledge base of every podcast and video you care about.

Get Started Free