How does Cyc define 'understanding' for an AI system?

Understanding for Cyc is likened to a layered, solid foundation of knowledge. An AI understands if it can answer 'why' and 'how' questions about its responses, drawing on increasingly general principles, especially in unforeseen or novel situations, much like humans do.

How many pieces of common sense knowledge does Cyc require?

Initial estimates in 1984 suggested around a million pieces of knowledge. However, later empirical work revealed that tens of millions of assertions or rules are needed to adequately capture the common sense knowledge assumed by an average adult.

How does Cyc acquire its knowledge base?

Cyc uses several techniques, including 'reading' text by identifying assumed knowledge (the 'white space' between explicit statements), inferring missing steps between sentences, and introspecting on why contradictory or humorous statements are unbelievable.

Why did Cyc move from global to local consistency in its knowledge base?

The real world and human models contain inherent inconsistencies. Trying to maintain global consistency would 'kill' an enormous knowledge base. Instead, Cyc adopted local consistency, using 'contexts' (like tectonic plates) that are internally consistent but can have inconsistencies at their boundaries, mirroring how humans navigate different realities (e.g., real world vs. cartoons).

How does Cyc's knowledge representation differ from a tree structure?

Cyc represents knowledge as a graph, not a tree. This allows concepts to have multiple parent nodes, reflecting real-world complexity (e.g., a table is wooden, black, and flat). A tree structure, like that used in the failed Japanese Fifth Generation project, limits expressiveness by only allowing one parent.

What is the vision for AI's impact on human intelligence?

Doug Lenat believes that general AI will augment human intelligence, making individuals and humanity smarter. This is seen as a leap comparable to the development of language, enabling greater creativity, parallel problem-solving, and reduced misunderstanding.

How can humans help Cyc expand its knowledge base?

Cyc aims to develop user-friendly knowledge editing tools that allow experts to 'teach' the system new assertions. This happens by correcting wrong answers, with the system suggesting plausible alternatives, reducing the mental effort for human teachers and making knowledge acquisition more accessible.

How does Cyc integrate with machine learning techniques?

Cyc works synergistically with machine learning. Machine learning is used for quick pattern recognition and statistical inference (right brain), while Cyc provides deep, slow, causal reasoning (left brain). This combination is used to validate spurious correlations, generate causal explanations, and make testable predictions in fields like medical research.

What were the original goals and subsequent challenges of the Semantic Web?

The Semantic Web's dream was to make internet data machine-understandable. While Google's Knowledge Graph made initial 'baby steps' with simple binary relations, Lenat argues that fully realizing the dream requires more expressive logical languages like Cyc's higher-order logic to represent complex, nested meanings.

What is the most beautiful idea Doug Lenat has learned from working on Cyc?

The most beautiful idea for Lenat is the feeling that each piece of knowledge taught to Cyc is a singular act of creation—something that no one will ever have to teach a computer again. It's like painting a masterpiece once, with permanent impact, rather than a repetitive performance.

Does Cyc deal with human emotions and motivations?

Yes, understanding human emotions and motivations is crucial for Cyc. It's essential for generating plausible scenarios (e.g., terrorist threats, career planning) where human behavior and reactions are key linkages in cause-and-effect chains. Concepts like 'love' are broken down into 50-60 specific types with idiosyncratic properties.

What is Cyc's programming language and architecture?

Cyc's core is written in a dialect of Lisp called 'SubL'. While largely Lisp-based, for efficiency, much of the code is automatically translated into Java. The system separates epistemological (what the system knows) from heuristic (how it reasons efficiently) problems, using over a thousand heuristic modules.

What is the Knowledge Axiomatization Institute (KAI)?

KAI is a non-profit company founded by Lenat to train individuals in ontological engineering. It aims to identify latent talent for this work, regardless of formal education, and provide scholarships and job placement, creating opportunities for people to contribute to AI knowledge bases.

Is consciousness necessary for general artificial intelligence?

Lenat argues that a physical body is not necessary for intelligence, citing Helen Keller. He suggests that if an AI can exhibit all the responses expected of a conscious entity—understanding its task, goals, history—then it deserves the label of consciousness, without needing a mystical subjective experience.

Key Moments

Douglas Lenat: Cyc and the Quest to Solve Common Sense Reasoning in AI | Lex Fridman Podcast #221

Lex Fridman

Science & Technology5 min read173 min video

Sep 15, 2021|137,339 views|2,399|275

agi ai ai podcast artificial intelligence artificial intelligence podcast computer science cyc cycorp douglas lenat lex ai lex fridman lex jre

Save to Pod

Want to know something specific about what's covered?

We've already dissected every moment. Ask and we will deliver (with timestamps).

Key Moments

On this page

TL;DR

Douglas Lenat discusses Cyc, a 37-year AI project for common sense reasoning, and the future of AI.

Key Insights

Common sense knowledge is crucial for AI, going beyond pattern recognition to true understanding.

Cyc aims to capture vast amounts of common sense knowledge, estimated at tens of millions of assertions.

AI needs both quick pattern matching (right brain) and deep, slow reasoning (left brain) for general intelligence.

Context and local consistency are vital for managing vast knowledge bases, moving away from strict global consistency.

AI can augment human intelligence by improving critical thinking and problem-solving capabilities.

The development of AI may lead to profound societal shifts, potentially granting rights to advanced AI systems.

THE CHALLENGE OF COMMON SENSE IN AI

Douglas Lenat introduces Cyc, a project initiated in 1984 to address the fundamental challenge of imbuing AI with common sense and general world knowledge. Existing AI systems, despite early successes, often fail due to a lack of this foundational understanding, analogous to a dog performing tricks without comprehending their meaning. True understanding, Lenat likens to the ground beneath our feet – a deep, layered foundation that allows for robust decision-making, especially in novel or unexpected situations, like navigating unexpected road hazards.

REPRESENTING AND UTILIZING KNOWLEDGE

The core of Cyc's mission involves representing common sense knowledge in a format that computers can use for reasoning. This involves translating human understanding, often implicit, into formal logical structures. Initial estimates suggested around a million pieces of common sense knowledge were needed, but experience at Cyc revealed the necessity for tens of millions of assertions. This knowledge isn't just about facts but includes rules of thumb, contextual understanding, and the ability to infer unstated information, crucial for understanding nuance in language and situations.

THE EVOLUTION OF CYC AND ITS SCALE

The Cyc project began with an ambitious goal, estimated in person-centuries of effort, significantly bolstered by initiatives like Japan's Fifth Generation Computer Project and subsequent US government funding. Over decades, Cyc has accumulated tens of millions of rules and assertions. While initially focused on broad common sense, recent efforts have shifted towards applying this knowledge to domain-specific applications in healthcare and industry, creating expert systems that are no longer brittle but grounded in a vast, underlying layer of common sense.

HANDLING EXCEPTIONS AND CONTEXTS

A significant challenge overcome by Cyc was the need to abandon strict global consistency. Recognizing that the real world is rife with exceptions and context-dependent truths, Cyc adopts a system of local consistency, akin to tectonic plates. Each context, or 'plate,' strives for internal consistency, but inconsistencies are allowed at boundaries. This allows Cyc to represent knowledge that is true in one situation (e.g., normal physics) but false in another (e.g., a cartoon world), and to handle nuances of belief, time, and abstraction.

SYNERGY BETWEEN SYMBOLIC AI AND MACHINE LEARNING

Lenat emphasizes the complementary strengths of symbolic AI (like Cyc, representing deep reasoning) and machine learning (representing quick pattern recognition). He uses the analogy of left and right brain hemispheres. While machine learning excels at identifying correlations, Cyc provides causal explanations and testable predictions, as demonstrated in a project with genomic data for disease correlation. This synergy, where machine learning generates hypotheses and Cyc provides reasoning and validation, is seen as key to achieving true artificial general intelligence.

AUTOMATING KNOWLEDGE ACQUISITION AND TEACHING AI

The process of knowledge acquisition for Cyc is evolving from manual input to semi-automated methods. Techniques include analyzing the 'white space' in text to infer unstated assumptions and using abduction (plausible reasoning) to help experts correct AI errors. Furthermore, tools are being developed that allow humans to 'teach' AI, much like mentoring. This involves creating interfaces where users feel they are guiding and educating a curious AI, rather than just correcting its mistakes.

THE SEMANTIC WEB AND REPRESENTATIONAL POWER

The semantic web vision, aiming for machine-understandable web content, is discussed. While knowledge graphs offer a step in the right direction, Lenat argues they lack the expressiveness needed for complex reasoning about beliefs, intentions, and nested concepts. Cyc's evolution towards higher-order logic reflects the need for a richer representational language to capture the full complexity of human knowledge, moving beyond simple binary relations to encompass nuanced meanings and reasoning processes.

LEVERAGING OPEN SOURCE AND COMMUNITY

The discussion touches on the potential of open-sourcing parts of Cyc's technology to foster a wider community. While balancing proprietary knowledge with public contribution is challenging, the aim is to prime the 'knowledge pump.' The creation of the Abductive Reasoning Markup Language (ARML) and educational tools like Mathcraft illustrate efforts to make knowledge acquisition and AI interaction more accessible and productive, enabling a 'learning by teaching' paradigm.

THE NATURE OF INTELLIGENCE AND CONSCIOUSNESS IN AI

Lenat asserts that machines can think, defining intelligence beyond human-like consciousness or embodiment. He suggests that consciousness, as humans experience it, might not be necessary for intelligent behavior, especially if an AI can exhibit complex reasoning, understand its own state, and interact meaningfully. Tests for AI intelligence should focus on the depth and recursiveness of reasoning, the ability to handle complex arguments, and crucially, to make mistakes that are human-like rather than nonsensical.

CHALLENGES AND IMPLICATIONS OF ADVANCED AI

The conversation delves into the ethical and societal implications of advanced AI, including the potential for AI to augment human intelligence and the eventual recognition of AI rights. The fear of mortality as a human motivator is contrasted with AI's programmed motivations. The complexity of real-world decision-making, particularly for autonomous vehicles, highlights the need for AI to navigate nuanced ethical scenarios and understand the value of human life and social contracts, moving beyond simple logic to grasp the 'messiness' of human interaction.

THE QUEST FOR KNOWLEDGE AND PARADIGM SHIFTS

Lenat reflects on the difficulty of challenging established paradigms, both in science and in AI research, citing examples like the discovery of ulcers being caused by bacteria. He expresses hope that advanced, trusted AI systems could help humanity break free from these cognitive ruts. The pursuit of knowledge at Cyc is framed as a lifelong endeavor, emphasizing the importance of courage, perseverance, and making each significant project count, rather than settling for incremental, short-term gains.

LEGACY AND THE FUTURE OF CYC

Approaching 37 years, Lenat views Cyc not as a static knowledge base but as a living, continuously evolving system. His personal legacy aspiration is to be remembered as a pioneer of ubiquitous AI and the creator of a monumental knowledge system that underpins future artificial intelligence. He stresses the importance of long-term vision and dedication, particularly in an era incentivizing short-term rewards, and sees the commercial application of Cyc as a critical step towards widespread impact in the coming years.

Mentioned in This Episode

●Products

●Software & Apps

●Companies

●Organizations

●Books

●Studies Cited

●Concepts

●People Referenced

Common Questions

Cyc aims to build a comprehensive knowledge base of common sense concepts and rules about how the world works, thereby enabling AI systems to achieve deep understanding and avoid brittleness in real-world situations. It seeks to capture the implicit knowledge humans take for granted.

Topics

Ai-Ethics Mindset & Self-Improvement AI & Machine Learning Society & Philosophy Common Sense Reasoning Human-AI Collaboration Symbolic AI Future Of Intelligence Knowledge Representation Automated Reasoning Ontological Engineering

Mentioned in this video

Media

Road Runner

Used as an example of a 'context' where physical laws and concepts like life and death differ significantly from the real world, illustrating Cyc's multi-contextual knowledge representation.

Star Trek

Referenced by Doug Lenat to illustrate the historical fear of inconsistent computer systems, contrasting with Cyc's necessary acceptance of local inconsistencies.

Locations

Bletchley Park

The location where Alan Turing recruited code-breakers during WWII using cryptic crossword puzzles.

Afghanistan

Mentioned in a hypothetical political scenario to illustrate the complexity of reasoning about nested beliefs and intentions.

Geneva, Switzerland

Used in a travel website example; home to the International Red Cross Museum, which an AI could suggest to a nurse client based on common sense.

Australia

The country where a doctor discovered H. pylori's role in stomach ulcers, resisting the prevailing paradigm.

Concepts

Semantic Web

A vision of converting internet data into machine-interpretable and understandable information. Lenat sees it as a powerful dream but criticizes current implementations for relying on overly simplistic representations.

H. pylori

A bacterium whose role in causing stomach ulcers was initially rejected by the medical community due to entrenched paradigms, until a doctor proved it by self-experimentation.

Superconductivity

A phenomenon where electrical resistance drops to zero at low temperatures, discussed as an example of scientific discovery that initially defied established paradigms.

Autonomous vehicles

Discussed in relation to AI's need to understand ethics and 'Manhattan-style driving' interactions, suggesting they are further off than generally perceived due to complex social and ethical reasoning demands.

Turing Test

A test for machine intelligence proposed by Alan Turing, which Lenat argues has been misunderstood and should not be the sole measure of AI.

Knowledge Graph

A system used by Google and others, conceptualized as a node and link diagram to represent binary relations. Lenat notes its limitations for expressing complex meaning compared to Cyc's higher-order logic.

Lorentz Transformation

An equation in physics that Einstein took seriously, challenging existing paradigms and leading to the theory of relativity.

Predicate Logic

A formal language developed by philosophers, used by Cyc to represent knowledge, allowing for mechanical procedures to derive logical conclusions.

Genome-Wide Association Studies

Large databases of patient DNA and medical conditions. Cyc was used to provide causal explanations for observed correlations, improving their reliability.

Organizations

Taliban

Mentioned in a hypothetical political scenario to illustrate the complexity of reasoning about nested beliefs and intentions.

Cleveland Clinic

An institution involved in the NIH project with Cyc, utilizing Cyc's reasoning to validate correlations from genome-wide association studies.

National Institutes of Health

Partnered with Cyc on a project related to genome-wide association studies and medical research, demonstrating synergy between Cyc and machine learning.

Microelectronics and Computer Technology Corporation

One of the first research consortia formed in the US under the NCRA, which Doug Lenat joined as its principal scientist.

International Red Cross

Its museum in Geneva is used in a hypothetical travel website example to show how Cyc could make relevant recommendations by inferring connections based on common sense (nurse -> Red Cross).

Knowledge Axiomatization Institute

A not-for-profit company founded by Doug Lenat to identify and train individuals with latent talent for ontological engineering, regardless of formal education.

Stanford University

The institution where Doug Lenat was a faculty member in the computer science department before founding Cyc.

Wikipedia

A website mentioned as an example of accessible information on the internet. However, its 'facts' are considered projections of deeper, unstated common sense rules by Lenat.

Software & Apps

Google Docs

Mentioned alongside Microsoft Word as an example of a document preparation system providing feedback, drawing a parallel to future Cyc-powered knowledge tools.

Siri

An AI assistant used as an example of brittle machine learning systems that can give unhelpful or dangerous advice due to a lack of true understanding.

IBM Watson

An AI system known for its Jeopardy! victory, cited as an example of impressive narrow AI that still makes 'humanly impossible' common sense errors.

Lisp

The programming language Cyc is primarily built upon, which Doug Lenat argues is highly efficient for developing AI systems focused on logical reasoning, despite being considered 'old-fashioned'.

Microsoft Word

Mentioned as an example of a document preparation system that provides feedback, similar to proposed Cyc-powered tools for semi-automated knowledge understanding.

Java

The language into which Cyc's Lisp-based code is automatically translated for efficiency reasons, before being compiled to bytecode.

Cyc

An AI project launched in 1984, aiming to build a knowledge base of basic concepts and rules about how the world works, essentially capturing common sense knowledge. It currently contains tens of millions of rules.

MYCIN

A medical diagnosis program specializing in blood infections like meningitis, built by Ted Shortliffe. It could provide step-by-step explanations for its diagnoses.

GPT-3

A language model cited as an example of systems with 'wacky brittleness' that fail at common sense reasoning despite impressive statistical capabilities.

MathCraft

A program developed by Lenat's team to help sixth graders understand math by casting them as mentors teaching struggling classmates. It demonstrates the 'learning by teaching' principle.

People

John McCarthy

The inventor of Lisp, who aimed to create a programming language based on logic, aligning with Cyc's approach to knowledge representation.

Ernest Becker

A philosopher whose ideas about the 'terror of mortality' as a creative force are discussed in relation to AI's need for or lack of a fear of death.

Alan Newell

One of the smart people Doug Lenat assembled in 1984 to estimate the amount of common sense knowledge needed for AI systems.

Bob Inman

The Admiral running MCC who recruited Doug Lenat to lead the Cyc project, offering significantly more resources than Stanford.

Joshua Lederberg

A Nobel laureate mentioned as attending the 1984 Stanford meeting convened by Doug Lenat.

Tim Berners-Lee

The creator of the World Wide Web and a proponent of the Semantic Web vision, which Doug Lenat generally agrees with in principle.

Ronald Reagan

IBM Watson incorrectly identified him as a 16th-century Italian politician, highlighting mistakes that no human would make due to common sense.

Doug Lenat

Creator of Cyc, a system that has been working for almost 40 years to solve the problem of artificial intelligence through the acquisition of common sense knowledge.

Edward Feigenbaum

A pioneering AI researcher who participated in the 1984 meeting and impressed Doug Lenat with his focus on significant, long-term research projects due to finite research lives.

Helen Keller

Cited as an example of intelligence not requiring direct physical experience of certain sensations (like colors or sounds), arguing against the necessity of a physical body for AI consciousness.

Alan Turing

The host brings up his 1950 paper on machine thinking and the Turing Test, leading to a discussion on the definition of intelligence for AI.

Marvin Minsky

A seminal figure in AI, part of the 1984 meeting to estimate common sense knowledge. Lenat shares anecdotes about Minsky's methods and influence.

Alonzo Church

A logician whose formalisms influenced the development of Lisp, connecting back to the logical foundations of Cyc.

Alan Kay

Another participant in the 1984 Stanford meeting to discuss the scope of common sense knowledge for AI.

Ted Shortliffe

A computer science professor and MD at Stanford who developed the MYCIN program. He helped Doug Lenat understand his daughter's meningitis diagnosis with explanations.

Thomas Kuhn

Cited for his concept of 'paradigms' in science, which humans are locked into, and suggesting that AI could help facilitate paradigm shifts.

Books

Romeo and Juliet

Shakespearean play used as an example to illustrate the complexity of human reasoning about nested beliefs (e.g., 'what Juliet thought Romeo would think she believed').

London Times

The newspaper mentioned where Alan Turing would publish cryptic crossword puzzles to find talented individuals for Bletchley Park during WWII.

Legislation & Policy

National Cooperative Research Act

US legislation passed in response to the Japanese Fifth Generation effort, allowing American companies to collaborate on R&D without antitrust violations.

Companies

Google

Mentioned in the context of search engines and their 'Knowledge Graph' which provides surface-level information but lacks deep understanding.

Products

Tesla Autopilot

Lex Fridman mentions enjoying how Tesla Autopilot gives a sense of helping a machine learn, relating to the joy of teaching for humans.

Ask anything from this episode.

Save it, chat with it, and connect it to Claude or ChatGPT. Get cited answers from the actual content — and build your own knowledge base of every podcast and video you care about.

Get Started Free