What is Anca Dragan's favorite fictional robot and why?

Anca Dragan's favorite fictional robot is Wally from Pixar. She admires its amazing expressiveness and the way its motion, arms, and lenses convey emotion and character. Her husband even proposed to her by building an actuated Wally robot, giving it a special personal connection.

How do robots understand human preferences and intentions?

Robots use approaches like Inverse Reinforcement Learning (IRL) to infer reward functions from human behavior, assuming humans act optimally. This has evolved to Boltzmann rationality, which accounts for human 'noise' or approximate actions. However, for more complex tasks, models need to account for differing human assumptions about the world, like intuitive physics models.

Can robots actively influence human behavior to learn more?

Yes, robots can take 'information-gathering actions' to solicit informative responses from humans. For example, an autonomous car might subtly 'nudge' into a lane to gauge if a human driver will slow down, thereby inferring their driving style (aggressive or defensive). This leverages the game-theoretic nature of human-robot interaction.

How complicated are human models needed for effective human-robot interaction?

Simple models assuming approximate rationality often work, but for complex tasks, they fall short. The challenge is to understand human behavior not as irrationality, but as rationality under different assumptions or beliefs about the world. This involves modeling humans' intuitive physics, planning horizons, and even their ongoing learning of their own preferences.

What is the role of simulation in human-robot interaction research?

Simulation is useful for training models, but a key challenge is that models built only from observed human data might fail 'out of distribution' (in novel situations). Human-constructed priors and inductive biases, such as assuming humans have intentions driving their actions, are crucial to make generalized models that work beyond seen data.

What are the challenges of designing reward functions for robots?

Designing reward functions is difficult because it's hard to anticipate every possible situation a robot might face and guarantee desired behavior. Often, designers 'tune' reward functions for specific scenarios, a process that still requires significant human effort. This problem is similar to 'Goodhart's Law,' where explicit metrics can cease to be effective when optimized directly.

How can robots learn from 'leaked' human information?

Humans 'leak' information about their preferences through their behavior. This can be direct, like physically pushing a robot away (indicating disagreement with its action), or indirect, like pressing an emergency stop button (signaling that the robot's intended action was undesirable). Even the arrangement of objects in an environment can implicitly convey human preferences.

What are Anca Dragan's thoughts on Isaac Asimov's Three Laws of Robotics?

She believes the Three Laws are a 'silly notion' in a practical sense because they are words that need to be translated into measurable math. Instead of rigid laws, she advocates for robots to continually learn and adapt their understanding of human preferences from ongoing signals and context, rather than taking programmer-specified rewards literally.

What book had a major impact on Anca Dragan's career?

Anca Dragan was greatly influenced by 'AI: A Modern Approach' by Stuart Russell and Peter Norvig. Reading a PDF copy in 12th grade captivated her with the idea that an agent could autonomously achieve a goal through a messy, complicated situation by optimizing decisions.

Key Moments

Anca Dragan: Human-Robot Interaction and Reward Engineering | Lex Fridman Podcast #81

Lex Fridman

Science & Technology5 min read99 min video

Mar 19, 2020|71,005 views|1,981|100

anca dragan berkeley robotics hri human-robot interaction waymo tesla artificial intelligence agi ai ai podcast artificial intelligence podcast

Save to Pod

Key Moments

TL;DR

Professor Anca Dragan discusses human-robot interaction, reward engineering, and understanding human behavior.

Key Insights

Human-robot interaction requires robots to understand and adapt to human behavior, not just perform tasks in isolation.

Developing robots that can express emotions or intentions is challenging but crucial for deeper human connection.

Understanding human preferences is key, often approached through inverse reinforcement learning, but more complex models are needed.

Robots can actively gather information about human intentions and preferences through their own actions.

The design of effective reward functions for robots is complex and often requires continuous learning and adaptation.

Human behavior, while sometimes appearing irrational, can often be understood by considering their unique assumptions and constraints.

THE EVOLUTION OF ROBOTICS AND HUMAN CONNECTION

Anca Dragan's journey into robotics began gradually, evolving from programming and math into applied AI, eventually finding a true passion in robotics at Carnegie Mellon. Initial work focused on manipulation, but transformative experiences, like riding in a self-driving car and interacting with Boston Dynamics' Spot Mini, highlighted the potential for robots to foster deeper human connections beyond mere task execution. This shift in perspective emphasizes the importance of how robots appear and behave in relation to humans, moving beyond purely functionalistic interactions.

THE CHALLENGE OF MODELING HUMAN BEHAVIOR

A central theme is the difficulty of accurately modeling human behavior for robots. This challenge is two-fold: predicting human actions and satisfying their preferences. While traditional approaches like inverse reinforcement learning offer a starting point by inferring reward functions from observed behavior, they often rely on simplified models of rationality. Dragan points out that human actions can seem irrational due to different assumptions, beliefs, or computational constraints, necessitating more sophisticated models that account for this complexity.

INVERSE REINFORCEMENT LEARNING AND BEYOND

Inverse Reinforcement Learning (IRL) is presented as a powerful tool for understanding human preferences by inferring what rewards drive their actions. This is based on the economic principle of utility maximization, with extensions like Boltzmann rationality accounting for human noise and stochasticity. However, Dragan notes that even these models struggle with certain complex tasks, like controlling a lunar lander or a robot arm, where human behavior deviates significantly from simple rational models, indicating the need for further advancements beyond current probabilistic approaches.

INFORMING ROBOTS THROUGH HUMAN INTERACTION

Robots need not be passive observers; they can actively influence and gather information about human preferences. This involves a collaborative approach where robots take actions to solicit informative responses and refine their understanding. For instance, an autonomous car can change its actions to observe how other drivers react, revealing their driving styles. This concept frames human-robot interaction as a game-theoretic problem where both agents influence each other, rather than just the robot reacting to static human behavior.

THE COMPLEXITY OF REWARD FUNCTION DESIGN

Designing effective reward functions for robots is a significant hurdle, even outside the context of human interaction. Dragan highlights that simply specifying a reward function doesn't guarantee desirable behavior in all situations due to factors like Goodhart's Law, where optimizing a metric can distort its original purpose. This leads to the idea of reward learning, where humans provide implicit signals, such as physical interventions or emergency stops, that the robot can interpret to refine its understanding of desired outcomes and preferences.

HUMAN-ROBOT INTERACTION AS AN UNDER-ACTUATED PROBLEM

Dragan draws an analogy between human-robot interaction and under-actuated systems in robotics, where not all degrees of freedom can be directly controlled. Humans, situated in a shared environment with robots, are not static entities but can influence the robot's behavior. This perspective suggests that robots should aim to influence human actions subtly, much like in a dance, rather than attempting direct control. The goal is to empower both the human and the robot to achieve better outcomes collaboratively by understanding and leveraging these limited degrees of influence.

THE ROLE OF SIMULATION AND DATA IN LEARNING

Simulation plays a vital role in training robots for human interaction, allowing for the development and testing of models in various scenarios. However, Dragan emphasizes that relying solely on data can be problematic, especially when encountering out-of-distribution situations. The key lies in a combination of learned models and human expertise, incorporating priors and inductive biases to ensure generalization. This approach acknowledges that humans possess reasoning capabilities, including common sense and an understanding of physics, which current data-driven methods often struggle to replicate.

ADDRESSING THE HUMAN ELEMENT IN AUTONOMOUS SYSTEMS

The presence of humans significantly complicates tasks like driving. While perception issues are becoming manageable, human behavior introduces unpredictable factors that are orders of magnitude more difficult to solve than purely mechanical challenges. This is evident in the ongoing development of autonomous vehicles, where extensive engineering and algorithmic adjustments are required to navigate complex urban environments safely and effectively, highlighting the deep challenge human interaction poses to AI systems.

THE ETHICS OF HUMAN-ROBOT INTERACTION

The discussion touches on the ethical implications of human-robot relationships, referencing Asimov's Three Laws as a primitive framework. Dragan argues that a rigid, rule-based system is inadequate; instead, robots should be designed to continuously learn and adapt their understanding of human intentions and preferences. The idea of 'leaked' information from human behavior and environmental context offers clues to desired robot actions, suggesting a future where robots are more attuned to human needs and nuances through ongoing interaction and learning.

THE MEANING OF LIFE AND ROBOT REWARD FUNCTIONS

Contemplating the meaning of life, Dragan suggests that impacting our immediate communities and being present for others is paramount, given the vastness of the universe. This perspective aligns with the ongoing challenge in robotics: defining reward functions that capture human values and fulfillment. The finite nature of existence, a source of beauty and meaning for humans, also presents a profound lesson for AI, emphasizing the need for robots to operate within constraints and understand context, rather than blindly optimizing abstract goals.

Mentioned in This Episode

●Products

●Software & Apps

●Companies

●Organizations

●Books

●Concepts

●People Referenced

Common Questions

Anca Dragan initially focused on programming and math, then AI. Her entry into robotics was somewhat accidental when she joined Carnegie Mellon's Robotics Institute. A pivotal moment was riding in a Google self-driving car in 2014, which profoundly influenced her trajectory towards autonomous systems and human-robot interaction.

Topics

Artificial Intelligence Mental Health & Psychology AI & Machine Learning Human-robot Interaction Autonomous Systems Machine Learning Robot Autonomy Philosophical AI Reward Engineering Cognitive Modeling

Mentioned in this video

Organizations

UC Berkeley

Anca Dragan is a professor at UC Berkeley, where she conducts her research on human-robot interaction.

Carnegie Mellon University

Anca Dragan's alma mater, specifically its Robotics Institute, where she pursued her PhD after her exchange semester.

Software & Apps

Cash App

A finance app that allows users to send money, buy Bitcoin, and invest in the stock market. It supports fractional share trading and is a sponsor of the podcast.

Amazon Alexa

An AI assistant that children interact with, often rudely, highlighting a challenge in human-robot interaction regarding social development and robot expressiveness.

Autopilot

Tesla's semi-autonomous driving system, mentioned in the context of level 2 autonomy and the human supervisory role.

Media

Wally

Anca Dragan's favorite fictional robot due to its amazing expressiveness and movement; her husband even proposed by building an actuated Wally robot.

Lunar Lander

A challenging Atari game used as an example where human operators struggle, making it difficult for models assuming simple rationality to provide assistance.

The Good Place

A TV show that explores philosophical concepts like life, death, and the afterlife, resonating with discussions on human mortality and the meaning of existence.

Companies

Cosmo

A robot mentioned as an example of a system where specific animations were handcrafted to achieve expressivity in a narrow setting.

Waymo

An autonomous vehicle company where Anca Dragan consults, although she emphasizes her UC Berkeley role in this conversation.

Google

Anca Dragan recounts a transformative experience riding in a Google self-driving car in 2014, which inspired her application in autonomous vehicles.

Boston Dynamics

The company behind the SpotMicro robot, known for its advanced robotics engineering.

Pixar

The animation studio behind the film 'Wally', praised for its ability to create expressive and emotional animated characters.

Netflix

Used as an example of a system where user choices for content selection are interpreted by AI, despite users actively learning about their own preferences.

People

John von Neumann

A mathematician and polymath foundational to utility maximization theory in economics.

Oskar Morgenstern

An economist who collaborated with John von Neumann on utility maximization theory.

R. Duncan Luce

Pioneer in behavioral economics, who suggested that people's choices might be noisy and approximate, evolving utility maximization.

Josh Tenenbaum

A cognitive scientist whose work on intuitive physics in cognitive science is referenced in the context of modeling human worldviews.

Tom Griffiths

A cognitive scientist also known for studying intuitive physics, related to understanding human assumptions about the world.

Jim Keller

A legendary chip architect who previously led the Autopilot team and holds an intuition that driving is a ballistic problem, downplaying the human element.

Anca Dragan

A professor at UC Berkeley working on human-robot interaction algorithms, focusing on generating robot behavior that accounts for interaction and coordination with humans. She also consults at Waymo.

Nicole McConnell

Anca Dragan's high school physics teacher and mentor who tutored her for free and encouraged her to apply to colleges abroad, significantly impacting her career path.

Roger N. Shepard

Contributed to the understanding of probabilistic choices in human behavior, aligning with noisy utility maximization.

Elon Musk

The CEO of Tesla, whose statement about lidar being a 'crutch' is mentioned, sparking discussion about innovation versus sticking to existing solutions.

Stuart Russell

A prominent AI researcher and collaborator with Anca Dragan, who advocates for interpreting reward functions as good evidence of human preference, rather than rigid specifications.

Peter Norvig

Co-author of 'AI: A Modern Approach', a highly influential textbook for Anca Dragan's early career.

Peter Bol

A collaborator mentioned by Anca Dragan; they interpret designer-specified rewards as evidence of human preference rather than universal laws.

Isaac Asimov

Author known for his Three Laws of Robotics, which are discussed in the context of universal ethical guidelines for AI, and for a quote about challenging assumptions.

Concepts

Behavioral Economics

A field that emerged in the 1970s, arguing that people are not purely rational but are messy, emotional, and use heuristics.

Goodhart's Law

An observation that when a measure becomes a target, it ceases to be a good measure, applicable to reward function design in AI.

Boltzmann rationality

An evolution of inverse reinforcement learning that accounts for human noise and approximate optimal behavior, leading to stochastic choices.

Inverse reinforcement learning

A method used in robotics to infer a reward function from observed human behavior, assuming humans act optimally with respect to their preferences.

Bayes' Rule

A mathematical formalization for how robots can update their beliefs about human intentions and parameters based on new evidence from human actions.

Products

Lidar

A remote sensing method used by autonomous vehicles to measure distance by illuminating the target with laser light, recommended by Anca Dragan for safety reasons.

Books

AI: A Modern Approach

A foundational textbook in artificial intelligence by Stuart Russell and Peter Norvig, which inspired Anca Dragan to pursue AI and robotics.

Ask anything from this episode.

Save it, chat with it, and connect it to Claude or ChatGPT. Get cited answers from the actual content — and build your own knowledge base of every podcast and video you care about.

Get Started Free