Why explore alternative AI approaches when deep learning is succeeding?

While deep learning and LLMs are currently productive, Francois Chollet believes this approach is a transient 'stag' in AI development. He argues that focusing on deep learning prevents AI from reaching true optimality, and his lab aims to leapfrog directly to foundational principles of optimal AI.

What makes coding agents so successful right now?

The recent success of coding agents is attributed to their ability to operate in domains with verifiable reward signals, like code, where solutions can be formally tested. This allows for extensive training and refinement, making them highly useful even without a fundamental increase in fluid intelligence.

How does Francois Chollet define Artificial General Intelligence (AGI)?

Chollet defines AGI not by automating tasks, but by a system's ability to approach any new problem or domain with the same efficiency as a human in terms of learning and skill acquisition, requiring comparable amounts of data and computation.

What is the purpose of the ARC benchmark series?

The ARC benchmark series (V1, V2, V3) is designed to measure progress in AI capabilities, particularly fluid intelligence and agentic intelligence. It serves as a barometer for the industry, highlighting advancements like reasoning models and agentic strategies, and adapting to target the 'residual gap' between current AI and human abilities.

How does ARC AGI V3 differ from previous versions?

V1 and V2 focused on modeling static patterns. V3 measures agentic intelligence by placing AI agents into interactive, goal-oriented environments (like mini video games) where they must explore, learn, plan, and execute without explicit instructions, mirroring human trial-and-error learning.

Can AGI be achieved with massive parameter counts, or is it about a more fundamental approach?

Chollet suggests that while LLMs scale knowledge, true AGI might be achievable with a smaller codebase for a 'fluid intelligence engine' (potentially megabytes) coupled with a large knowledge base. He predicts that AGI will eventually be a codebase of less than 10,000 lines of code.

What advice does Francois Chollet have for aspiring AI researchers or developers?

He advises focusing on understanding the domain you want to apply AI to, developing expertise, and embracing AI as a tool for empowerment rather than fearing job displacement. Learning to leverage these tools effectively is key to riding the accelerating wave of AI progress.

Key Moments

François Chollet: ARC-AGI-3, Beyond Deep Learning & A New Approach To ML

Q: What is NDIA and how does it differ from deep learning?

NDIA is a new AGI research lab focused on developing a branch of machine learning that moves beyond deep learning towards greater optimality. It uses program synthesis and a new optimization method called symbolic descent to create concise, efficient, and generalizable symbolic models from data.

Y Combinator

Science & Technology6 min read58 min video

Mar 27, 2026|1,889 views|92|7

YC Y Combinator

Save to Pod

Key Moments

TL;DR

AI's current trajectory, focused on scaling LLMs, might be suboptimal. François Chollet proposes program synthesis as a more efficient path to AGI, building a new learning substrate for truly optimal AI.

Key Insights

François Chollet's lab, NDIA, is researching program synthesis as a new branch of machine learning, aiming for models closer to optimal than current deep learning approaches.

The ARC (Abstraction and Reasoning Corpus) benchmark has evolved: V1 struggled with reasoning models (scoring <10%), V2 was saturated by agents using post-training RL loops, and V3 introduces interactive, agentic intelligence measured by exploration efficiency.

Current LLM-based coding agents achieve success due to verifiable reward signals (like unit tests) and the ability to embed execution models, not necessarily due to higher fluid intelligence.

Chollet predicts AGI by 2030, coinciding with ARC-AGI 6 or 7, and believes true AGI might be a codebase under 10,000 lines, operating on a knowledge base, reminiscent of foundational scientific principles.

Chollet advocates for exploring alternative AI approaches beyond current LLM scaling, suggesting that redirecting investment into areas like genetic algorithms or older research from the 70s/80s could yield significant breakthroughs.

For aspiring AI researchers or developers, focusing on usability, community building, and integrating AI into domain expertise is key, as AI progress is inevitable and best leveraged as an empowering tool.

The limitations of current deep learning and the search for optimality

François Chollet discusses the current AI landscape, dominated by scaling deep learning models and LLMs. He argues that while this path is yielding results and driving progress, it's not necessarily optimal. Deep learning primarily relies on fitting parameters of a model to data using gradient descent. Chollet's new venture, NDIA, aims to build a fundamentally different branch of machine learning. Instead of parametric curves, they are developing symbolic models designed to be as concise and simple as possible to explain the data. This shift necessitates a new optimization method, dubbed 'symbolic descent,' to replace gradient descent. The goal is to create machine learning engines that yield extremely concise symbolic models, leading to significantly less data required for training, much more efficient inference, and better generalization and compositionality, aligning with the minimum description length principle.

Program synthesis as an alternative foundation

NDIA's core research lies in program synthesis, which Chollet clarifies is not about code generation or coding agents. Instead, it's about rebuilding the entire machine learning stack on different foundations. Rather than adding layers on top of the existing LLM stack, NDIA is creating a new 'learning substrate' distinct from parametric deep learning. This involves finding the shortest symbolic model to explain data, a process that cannot use gradient descent. "Symbolic descent" is their proposed solution, the symbolic equivalent of gradient descent. The promise is that this approach, while significantly different and with a lower perceived chance of immediate success (around 10-15%), could lead to AI that is much closer to optimality, requiring less data and generalizing better, unlike the current 'more compute, more data' scaling paradigm.

The ARC benchmark: a barometer for evolving AI capabilities

Chollet details the evolution of the ARC (Abstraction and Reasoning Corpus) benchmark, designed to measure fundamental intelligence rather than just performance on scaled data. ARC V1 initially showed very low scores for LLMs (sub-10%), even as models scaled massively, indicating that scale alone wasn't sufficient for fluid intelligence. The breakthrough came with reasoning models (like OpenAI's 01 and 03), which demonstrated a significant step-function improvement on V1, signaling the emergence of new capabilities. ARC V2 saw saturation when agentic approaches, particularly those employing reinforcement learning loops and post-training verification mechanisms (similar to coding agents), were applied. This indicated that models could become more useful and achieve saturation through refined training paradigms and verifiable reward signals, even without necessarily becoming 'smarter' in a fluid intelligence sense.

Introducing ARC-AGI V3: measuring agentic intelligence

ARC-AGI V3 represents a significant shift, moving beyond static pattern modeling to measure 'agentic intelligence.' In V3, AI agents are placed in interactive, mini-video game-like environments without any initial instructions. They must explore, set their own goals, build a model of the environment through trial and error, and then execute plans to achieve those goals. The evaluation focuses on action efficiency, aiming for AI agents to perform with the same efficiency as humans, who can typically master these novel environments within hundreds to thousands of actions. V3 is designed to be more resistant to the 'harness' strategies used to saturate V2, featuring a private set of significantly different games that are not directly representative of performance on the public set, thus better testing fluid intelligence.

The future of AGI and the quest for fundamental principles

Chollet predicts AGI could arrive as early as 2030, potentially coinciding with ARC-AGI 6 or 7. He posits that true AGI might not require billions of parameters but could be a relatively small codebase (under 10,000 lines) operating on a large knowledge base, akin to embodying the scientific method. This vision contrasts with the current trend of massive model scaling. He believes intelligence acquisition is key, and while human intelligence is complex and messy, it offers inspiration. NDIA aims to identify fundamental principles of intelligence and build a system that optimally implements them, rather than replicating biological processes. This approach prioritizes recursive self-improvement and efficiency over sheer scale, aiming to remove humans from the continuous improvement loop.

Investing in alternative paths and foundational research

Chollet encourages the AI community to explore approaches beyond the dominant LLM paradigm. He suggests that immense resources poured into current methods could yield comparable breakthroughs if invested in other areas like genetic algorithms or older, less explored research from the 70s and 80s. He views the current unification around gradient descent and LLMs as potentially limiting. For aspiring researchers, he recommends delving into foundational, often overlooked, research and looking for approaches that inherently scale without requiring constant human engineering intervention, focusing on systems that can improve themselves without bottlenecks. The goal is to build intelligence from first principles, not just extend existing models.

Leveraging AI progress: empowerment through expertise

Addressing concerns about job displacement and AI taking over, Chollet offers an optimistic perspective. He argues that increased AI capabilities do not necessarily mean humans will be obsolete; instead, they can be empowering. The more expertise an individual has in a domain, the better they can leverage AI tools. He advises people to learn as much as possible, not only about AI but also about the specific domains they wish to apply it to. The key is to adopt a mindset of using AI progress as an opportunity and a tool for personal and professional advancement, rather than viewing it as an unstoppable force to be passively subjected to. Riding the wave of AI progress by integrating it with domain knowledge is the question to ask.

Mentioned in This Episode

●Software & Apps

●Companies

●Organizations

●Concepts

●People Referenced

Common Questions

NDIA is a new AGI research lab focused on developing a branch of machine learning that moves beyond deep learning towards greater optimality. It uses program synthesis and a new optimization method called symbolic descent to create concise, efficient, and generalizable symbolic models from data.

Topics

AI & Machine Learning Technology & Innovation Science & Mathematics Artificial General Intelligence AI Benchmarks Symbolic AI Deep Learning Alternatives Machine Learning Paradigms Agentic Intelligence Computational Optimality

Mentioned in this video

Organizations

NDIA

A new AGI research lab focused on building a new branch of machine learning that is closer to optimal than deep learning, using program synthesis and symbolic descent.

Software & Apps

GStack

An open-source project that has gained significant traction, reaching 40,000 stars.

Keras

An open-source deep learning library developed by France Chollet, noted for its simple API, usability, and community building.

scikit-learn

An influential machine learning library for Python, which inspired Keras with its ease of use and accessibility.

LLM stack

The current dominant architecture in AI, which the speaker believes is a temporary stage and not the path to true optimality.

coding agents

AI systems that have shown surprising success recently, primarily due to their ability to operate in domains with verifiable reward signals like code.

AlphaGo

Mentioned as an example of deep learning guiding search, similar to the principles being applied at NDIA for program synthesis.

Concepts

program synthesis

A research area focused on building new branches of machine learning by creating symbolic models instead of parametric ones, aiming for greater optimality and efficiency.

State Space Models

An alternative AI architecture that builds on the current stack, representing a slightly different approach to AI modeling.

Deep Learning

The current dominant approach in machine learning, which the speaker contrasts with their new paradigm, arguing it's nearing its limits for achieving true optimality.

symbolic descent

An alternative to gradient descent, designed for finding the simplest possible symbolic models of data, aiming for greater conciseness and generalization.

minimum description length principle

A principle suggesting that the shortest model of data is the most likely to generalize, which underpins the NDIA approach.

mathematics

A domain that is well-suited for current AI technology due to its inherent verifiable reward signals, as are other formally verifiable domains.

genetic algorithms

An alternative AI approach that the speaker believes has significant potential and could be scaled up to achieve exciting results, potentially even enabling new scientific discoveries.

Companies

OpenAI

Mentioned in the context of their reasoning models (01 and 03) demonstrating significant progress on the ARC benchmark.

Media

Dota 2

Mentioned as an example of a game where OpenAI's models (OpenAI Five) were trained for extensive periods, highlighting differences with ARC's approach.

Atari games

DeepMind's early work in 2013 on solving these games using deep reinforcement learning is cited as a pioneering effort in using AI for game-playing.

People

Douglas Lenat

His 'Psych' project is mentioned as a comparison to NDIA's approach regarding knowledge bases and symbolic representation.