Devon
An AI agent that Arena is considering evaluating and integrating through partnerships, noted for its capabilities.
Common Themes
Videos Mentioning Devon

Beating OpenAI and Anthropic by Looking At Data: the new #1 on SWE-Bench w/ W&B CTO Shawn Lewis
Latent Space
An AI agent that can perform complex tasks, acknowledged for its impressive UI and potential, especially with upcoming model improvements.

E170: Tech's Vibe Shift, TikTok ban debate, Vertical AI boom, Florida bans lab-grown meat & more
All-In Podcast
An AI software engineer tool from Cognition AI, demonstrated fixing bugs, fine-tuning models, and building apps end-to-end, outperforming generic LLMs on coding benchmarks.
![The State of AI Startups in 2024 [LS Live @ NeurIPS]](https://i.ytimg.com/vi/HM1d7kMebEI/maxresdefault.jpg)
The State of AI Startups in 2024 [LS Live @ NeurIPS]
Latent Space
AI agent that offers code execution capabilities for a monthly fee.

The new Claude 3.5 Sonnet, Computer Use, and Building SOTA Agents — with Erik Schluntz, Anthropic
Latent Space
Mentioned as an example of an agent framework that allows users to edit the plan as it goes along.
![[Paper Club] SWE-Bench [OpenAI Verified/Multimodal] + MLE-Bench with Jesse Hu](https://i.ytimg.com/vi/ULcwHlxfSkQ/maxresdefault.jpg)
[Paper Club] SWE-Bench [OpenAI Verified/Multimodal] + MLE-Bench with Jesse Hu
Latent Space
An agent mentioned in the context of SWE-Bench leaderboards and evaluations, distinguished from basic prompting or scaffolding.

Language Agents: From Reasoning to Acting — with Shunyu Yao of OpenAI, Harrison Chase of LangGraph
Latent Space
An AI startup focused on coding agents, highlighted for its user experience and agent-computer interface design.

India’s Fastest Growing AI Startup
Y Combinator
A coding agent that was just getting started around the time Emergent was developing its own agents. It is mentioned as part of the early AI landscape.

DeepWiki: The GitHub Encyclopedia
Latent Space
An AI coding tool developed by Cognition, which the hosts are casual users of.
![[State of Evals] LMArena's $1.7B Vision — Anastasios Angelopoulos, LMArena](https://i.ytimg.com/vi/NBnOk0Uy9ig/maxresdefault.jpg)
[State of Evals] LMArena's $1.7B Vision — Anastasios Angelopoulos, LMArena
Latent Space
An AI agent that Arena is considering evaluating and integrating through partnerships, noted for its capabilities.

10 People + AI = Billion Dollar Company?
Y Combinator

ChatGPT Codex: The Missing Manual
Latent Space
Mentioned as an alternative AI coding agent that focuses on multi-shot human feedback, contrasting with Codeex's one-shot approach.

Claude Code: Anthropic's CLI Agent
Latent Space
A tool mentioned in comparison to Claude Code.

The #1 SWE-Bench Verified Agent
Latent Space
A competitor in the enterprise coding agent space, mentioned as part of a landscape analysis.

Malcolm Gladwell: Working From Home Is Destroying Us! | E162
The Diary Of A CEO
The rural area in the UK where Stephen Bartlett grew up, whose parents advised him and his siblings to leave to find opportunities.

Why is everyone cloning Deep Research?
Latent Space
An agent mentioned by a host as a preferred user experience model, where the plan is visible and can be updated interactively during execution.