Key Moments

Douglas Lenat: Cyc and the Quest to Solve Common Sense Reasoning in AI | Lex Fridman Podcast #221

Lex FridmanLex Fridman
Science & Technology5 min read173 min video
Sep 15, 2021|136,861 views|2,383|275
Save to Pod
TL;DR

Douglas Lenat discusses Cyc, a 37-year AI project for common sense reasoning, and the future of AI.

Key Insights

1

Common sense knowledge is crucial for AI, going beyond pattern recognition to true understanding.

2

Cyc aims to capture vast amounts of common sense knowledge, estimated at tens of millions of assertions.

3

AI needs both quick pattern matching (right brain) and deep, slow reasoning (left brain) for general intelligence.

4

Context and local consistency are vital for managing vast knowledge bases, moving away from strict global consistency.

5

AI can augment human intelligence by improving critical thinking and problem-solving capabilities.

6

The development of AI may lead to profound societal shifts, potentially granting rights to advanced AI systems.

THE CHALLENGE OF COMMON SENSE IN AI

Douglas Lenat introduces Cyc, a project initiated in 1984 to address the fundamental challenge of imbuing AI with common sense and general world knowledge. Existing AI systems, despite early successes, often fail due to a lack of this foundational understanding, analogous to a dog performing tricks without comprehending their meaning. True understanding, Lenat likens to the ground beneath our feet – a deep, layered foundation that allows for robust decision-making, especially in novel or unexpected situations, like navigating unexpected road hazards.

REPRESENTING AND UTILIZING KNOWLEDGE

The core of Cyc's mission involves representing common sense knowledge in a format that computers can use for reasoning. This involves translating human understanding, often implicit, into formal logical structures. Initial estimates suggested around a million pieces of common sense knowledge were needed, but experience at Cyc revealed the necessity for tens of millions of assertions. This knowledge isn't just about facts but includes rules of thumb, contextual understanding, and the ability to infer unstated information, crucial for understanding nuance in language and situations.

THE EVOLUTION OF CYC AND ITS SCALE

The Cyc project began with an ambitious goal, estimated in person-centuries of effort, significantly bolstered by initiatives like Japan's Fifth Generation Computer Project and subsequent US government funding. Over decades, Cyc has accumulated tens of millions of rules and assertions. While initially focused on broad common sense, recent efforts have shifted towards applying this knowledge to domain-specific applications in healthcare and industry, creating expert systems that are no longer brittle but grounded in a vast, underlying layer of common sense.

HANDLING EXCEPTIONS AND CONTEXTS

A significant challenge overcome by Cyc was the need to abandon strict global consistency. Recognizing that the real world is rife with exceptions and context-dependent truths, Cyc adopts a system of local consistency, akin to tectonic plates. Each context, or 'plate,' strives for internal consistency, but inconsistencies are allowed at boundaries. This allows Cyc to represent knowledge that is true in one situation (e.g., normal physics) but false in another (e.g., a cartoon world), and to handle nuances of belief, time, and abstraction.

SYNERGY BETWEEN SYMBOLIC AI AND MACHINE LEARNING

Lenat emphasizes the complementary strengths of symbolic AI (like Cyc, representing deep reasoning) and machine learning (representing quick pattern recognition). He uses the analogy of left and right brain hemispheres. While machine learning excels at identifying correlations, Cyc provides causal explanations and testable predictions, as demonstrated in a project with genomic data for disease correlation. This synergy, where machine learning generates hypotheses and Cyc provides reasoning and validation, is seen as key to achieving true artificial general intelligence.

AUTOMATING KNOWLEDGE ACQUISITION AND TEACHING AI

The process of knowledge acquisition for Cyc is evolving from manual input to semi-automated methods. Techniques include analyzing the 'white space' in text to infer unstated assumptions and using abduction (plausible reasoning) to help experts correct AI errors. Furthermore, tools are being developed that allow humans to 'teach' AI, much like mentoring. This involves creating interfaces where users feel they are guiding and educating a curious AI, rather than just correcting its mistakes.

THE SEMANTIC WEB AND REPRESENTATIONAL POWER

The semantic web vision, aiming for machine-understandable web content, is discussed. While knowledge graphs offer a step in the right direction, Lenat argues they lack the expressiveness needed for complex reasoning about beliefs, intentions, and nested concepts. Cyc's evolution towards higher-order logic reflects the need for a richer representational language to capture the full complexity of human knowledge, moving beyond simple binary relations to encompass nuanced meanings and reasoning processes.

LEVERAGING OPEN SOURCE AND COMMUNITY

The discussion touches on the potential of open-sourcing parts of Cyc's technology to foster a wider community. While balancing proprietary knowledge with public contribution is challenging, the aim is to prime the 'knowledge pump.' The creation of the Abductive Reasoning Markup Language (ARML) and educational tools like Mathcraft illustrate efforts to make knowledge acquisition and AI interaction more accessible and productive, enabling a 'learning by teaching' paradigm.

THE NATURE OF INTELLIGENCE AND CONSCIOUSNESS IN AI

Lenat asserts that machines can think, defining intelligence beyond human-like consciousness or embodiment. He suggests that consciousness, as humans experience it, might not be necessary for intelligent behavior, especially if an AI can exhibit complex reasoning, understand its own state, and interact meaningfully. Tests for AI intelligence should focus on the depth and recursiveness of reasoning, the ability to handle complex arguments, and crucially, to make mistakes that are human-like rather than nonsensical.

CHALLENGES AND IMPLICATIONS OF ADVANCED AI

The conversation delves into the ethical and societal implications of advanced AI, including the potential for AI to augment human intelligence and the eventual recognition of AI rights. The fear of mortality as a human motivator is contrasted with AI's programmed motivations. The complexity of real-world decision-making, particularly for autonomous vehicles, highlights the need for AI to navigate nuanced ethical scenarios and understand the value of human life and social contracts, moving beyond simple logic to grasp the 'messiness' of human interaction.

THE QUEST FOR KNOWLEDGE AND PARADIGM SHIFTS

Lenat reflects on the difficulty of challenging established paradigms, both in science and in AI research, citing examples like the discovery of ulcers being caused by bacteria. He expresses hope that advanced, trusted AI systems could help humanity break free from these cognitive ruts. The pursuit of knowledge at Cyc is framed as a lifelong endeavor, emphasizing the importance of courage, perseverance, and making each significant project count, rather than settling for incremental, short-term gains.

LEGACY AND THE FUTURE OF CYC

Approaching 37 years, Lenat views Cyc not as a static knowledge base but as a living, continuously evolving system. His personal legacy aspiration is to be remembered as a pioneer of ubiquitous AI and the creator of a monumental knowledge system that underpins future artificial intelligence. He stresses the importance of long-term vision and dedication, particularly in an era incentivizing short-term rewards, and sees the commercial application of Cyc as a critical step towards widespread impact in the coming years.

Common Questions

Cyc aims to build a comprehensive knowledge base of common sense concepts and rules about how the world works, thereby enabling AI systems to achieve deep understanding and avoid brittleness in real-world situations. It seeks to capture the implicit knowledge humans take for granted.

Topics

Mentioned in this video

Concepts
Semantic Web

A vision of converting internet data into machine-interpretable and understandable information. Lenat sees it as a powerful dream but criticizes current implementations for relying on overly simplistic representations.

H. pylori

A bacterium whose role in causing stomach ulcers was initially rejected by the medical community due to entrenched paradigms, until a doctor proved it by self-experimentation.

Superconductivity

A phenomenon where electrical resistance drops to zero at low temperatures, discussed as an example of scientific discovery that initially defied established paradigms.

Autonomous vehicles

Discussed in relation to AI's need to understand ethics and 'Manhattan-style driving' interactions, suggesting they are further off than generally perceived due to complex social and ethical reasoning demands.

Turing Test

A test for machine intelligence proposed by Alan Turing, which Lenat argues has been misunderstood and should not be the sole measure of AI.

Knowledge Graph

A system used by Google and others, conceptualized as a node and link diagram to represent binary relations. Lenat notes its limitations for expressing complex meaning compared to Cyc's higher-order logic.

Lorentz Transformation

An equation in physics that Einstein took seriously, challenging existing paradigms and leading to the theory of relativity.

Predicate Logic

A formal language developed by philosophers, used by Cyc to represent knowledge, allowing for mechanical procedures to derive logical conclusions.

Genome-Wide Association Studies

Large databases of patient DNA and medical conditions. Cyc was used to provide causal explanations for observed correlations, improving their reliability.

Software & Apps
Google Docs

Mentioned alongside Microsoft Word as an example of a document preparation system providing feedback, drawing a parallel to future Cyc-powered knowledge tools.

Siri

An AI assistant used as an example of brittle machine learning systems that can give unhelpful or dangerous advice due to a lack of true understanding.

IBM Watson

An AI system known for its Jeopardy! victory, cited as an example of impressive narrow AI that still makes 'humanly impossible' common sense errors.

Lisp

The programming language Cyc is primarily built upon, which Doug Lenat argues is highly efficient for developing AI systems focused on logical reasoning, despite being considered 'old-fashioned'.

Microsoft Word

Mentioned as an example of a document preparation system that provides feedback, similar to proposed Cyc-powered tools for semi-automated knowledge understanding.

Java

The language into which Cyc's Lisp-based code is automatically translated for efficiency reasons, before being compiled to bytecode.

Cyc

An AI project launched in 1984, aiming to build a knowledge base of basic concepts and rules about how the world works, essentially capturing common sense knowledge. It currently contains tens of millions of rules.

MYCIN

A medical diagnosis program specializing in blood infections like meningitis, built by Ted Shortliffe. It could provide step-by-step explanations for its diagnoses.

GPT-3

A language model cited as an example of systems with 'wacky brittleness' that fail at common sense reasoning despite impressive statistical capabilities.

MathCraft

A program developed by Lenat's team to help sixth graders understand math by casting them as mentors teaching struggling classmates. It demonstrates the 'learning by teaching' principle.

People
John McCarthy

The inventor of Lisp, who aimed to create a programming language based on logic, aligning with Cyc's approach to knowledge representation.

Ernest Becker

A philosopher whose ideas about the 'terror of mortality' as a creative force are discussed in relation to AI's need for or lack of a fear of death.

Alan Newell

One of the smart people Doug Lenat assembled in 1984 to estimate the amount of common sense knowledge needed for AI systems.

Bob Inman

The Admiral running MCC who recruited Doug Lenat to lead the Cyc project, offering significantly more resources than Stanford.

Joshua Lederberg

A Nobel laureate mentioned as attending the 1984 Stanford meeting convened by Doug Lenat.

Tim Berners-Lee

The creator of the World Wide Web and a proponent of the Semantic Web vision, which Doug Lenat generally agrees with in principle.

Ronald Reagan

IBM Watson incorrectly identified him as a 16th-century Italian politician, highlighting mistakes that no human would make due to common sense.

Doug Lenat

Creator of Cyc, a system that has been working for almost 40 years to solve the problem of artificial intelligence through the acquisition of common sense knowledge.

Edward Feigenbaum

A pioneering AI researcher who participated in the 1984 meeting and impressed Doug Lenat with his focus on significant, long-term research projects due to finite research lives.

Helen Keller

Cited as an example of intelligence not requiring direct physical experience of certain sensations (like colors or sounds), arguing against the necessity of a physical body for AI consciousness.

Alan Turing

The host brings up his 1950 paper on machine thinking and the Turing Test, leading to a discussion on the definition of intelligence for AI.

Marvin Minsky

A seminal figure in AI, part of the 1984 meeting to estimate common sense knowledge. Lenat shares anecdotes about Minsky's methods and influence.

Alonzo Church

A logician whose formalisms influenced the development of Lisp, connecting back to the logical foundations of Cyc.

Alan Kay

Another participant in the 1984 Stanford meeting to discuss the scope of common sense knowledge for AI.

Ted Shortliffe

A computer science professor and MD at Stanford who developed the MYCIN program. He helped Doug Lenat understand his daughter's meningitis diagnosis with explanations.

Thomas Kuhn

Cited for his concept of 'paradigms' in science, which humans are locked into, and suggesting that AI could help facilitate paradigm shifts.

More from Lex Fridman

View all 230 summaries

Found this useful? Build your knowledge library

Get AI-powered summaries of any YouTube video, podcast, or article in seconds. Save them to your personal pods and access them anytime.

Try Summify free