Why did Jason Lou build the 'Flight' recommendation framework at Stitch Fix?

Due to inconsistencies and bespoke systems across different teams, Jason Lou created 'Flight' to standardize code quality, enable better observability, and allow for more efficient system maintenance and optimization.

How did Jason Lou's hand injury impact his career path?

His hand injury prevented him from coding for nearly two years, leading him to take medical leave and a tech lead role. This downtime allowed him to explore other interests like pottery and Jiu-Jitsu, and critically, to deeply explore LLMs through extensive prompting after GPT came out, which eventually led to Instructor.

What inspired the design philosophy of Instructor?

Instructor was designed to be like the 'requests' library for Python – simple, ubiquitous, and a standard tool for interacting with LLMs. The goal was to handle the 'boring' or repetitive parts of LLM integration, like data parsing and retries, so developers can focus on building their own frameworks.

What is the key difference between Instructor's approach and JSON mode for LLMs?

Instructor focuses on typed responses and true function calling, which provides schema specification and validation. JSON mode primarily outputs JSON but lacks the schema definition and error handling robustness that function calling offers, making Instructor's approach superior for complex data structures.

When is parallel function calling more appropriate than single function calling?

Parallel function calling is best suited for agent workflows where multiple distinct tools or functions need to be invoked independently (e.g., 'get weather' and 'turn on lights'). For extraction workflows, a single, more complex schema is generally better for specifying relationships between entities.

What are the main use cases for Instructor?

Instructor excels at extracting structured data (like receipt items), building knowledge graphs by identifying nodes and edges, and enhancing query understanding for APIs by resolving natural language requests into structured queries.

Why did Jason Liu choose a consulting route for Instructor instead of venture backing?

Jason prefers the access to interesting data and problem-solving opportunities provided by consulting, allowing him to work on diverse applications. He believes Instructor is more of a utility than a venture-backable billion-dollar company, and he's content with a profitable, smaller-scale business.

What advice does Jason Liu have for people who feel they've missed the boat on entering AI?

He advises recognizing that time passes regardless of action. If you want to enter AI, start now, even if it feels late. Excuses like 'I'll start when I'm fitter' or 'I'll be too old' are counterproductive, as you'll reach that age anyway, so better to start now and be one day less behind.

How does Jason Liu define 'high agency'?

High agency involves having the courage to do scary things, focusing on process metrics (like amount of clay used in pottery) rather than just outcomes. It means showing up, falling down, and iterating without excessive research or hesitation, embracing the process of learning and doing.

What is Jason Liu's perspective on the future of personal assistance and AGI?

He believes current personal assistants and writing tools are not yet good enough for complex tasks. He's experimenting by hiring people to help him modularize and understand these processes, looking for reproducible steps that go beyond simple integrations like email or calendars, and are not yet captured by current LLMs.

What is the difference between AI engineering and ML engineering according to Jason Liu?

He suggests AI engineering is emerging and may not require the deep ML background of ML engineers. Many motivated software engineers are drawn to AI engineering, but companies often struggle to accommodate their needs, creating a gap between excitement in AI and traditional ML skillsets.

Key Moments

High Agency Pydantic over VC Backed Frameworks — with Jason Liu of Instructor

Latent Space Podcast

Science & Technology3 min read63 min video

Apr 24, 2024|1,748 views|58|3

jason liu instructor swyx latent space alessio fanelli

Save to Pod

Key Moments

TL;DR

Instructorizes LLMs by providing structured data output, simplifying AI development.

Key Insights

Instructor, a Python SDK, simplifies obtaining structured data from LLMs, moving beyond simple string outputs.

The framework emphasizes a 'requests'-like philosophy, aiming for widespread adoption and ease of use.

Instructor supports various use cases including data extraction, knowledge graph generation, and query understanding.

While function calling offers precise schema definition, JSON mode can be more cost-effective for simpler outputs.

The AI engineering landscape is shifting towards enabling motivated software engineers, rather than solely relying on traditional ML expertise.

Jason Liu's consulting approach with Instructor prioritizes solving interesting problems over building a venture-backed company.

FROM STRING OUTPUTS TO STRUCTURED DATA

Jason Liu, creator of Instructor, discusses the fundamental shift LLMs represent: moving from raw string outputs to structured data. Instructor acts as a Python SDK that wraps OpenAI's SDK, focusing on providing typed responses. This allows developers to work with data structures, opening up possibilities for complex applications like data extraction, knowledge graph generation, and sophisticated query understanding, akin to solving LeetCode problems with LLM outputs.

THE 'REQUESTS' PHILOSOPHY FOR ADOPTION

Inspired by the simplicity and ubiquity of the 'requests' library in Python for HTTP calls, Liu has adopted a similar philosophy for Instructor. The goal is for Instructor to become a standard, almost built-in tool in the LLM development ecosystem. This approach prioritizes developer experience and ease of integration, aiming for a state where developers naturally opt for Instructor without overthinking it, much like they do with 'requests'.

NAVIGATING FUNCTION CALLING AND JSON MODE

The conversation delves into the nuances of structured output generation. While function calling offers robust schema definition and validation, allowing for complex relationships and constraints, JSON mode provides a simpler and potentially more cost-effective way to get JSON output. Liu highlights that function calling shines when precise schema definition is crucial, whereas JSON mode might suffice for less critical or simpler output structures.

EMERGING USE CASES AND ARCHITECTURAL SHIFTS

Instructor's capabilities extend to extracting complex graphs, defining nodes and edges between entities, and understanding nuanced user queries. Liu emphasizes that embeddings alone are often insufficient for complex queries. Instructor helps resolve these into structured requests, enabling more sophisticated data manipulation and interpretation, moving beyond simple retrieval to actionable insights and complex data processing.

THE EVOLUTION OF AI ENGINEERING TALENT

Liu argues that the demand for AI capabilities often outstrips the supply of traditional ML engineers. He advocates for recognizing and empowering motivated software engineers to transition into AI engineering roles. The focus is shifting from deep ML expertise to skills like prompt engineering and working with tools like Instructor, enabling rapid development and problem-solving in the AI space.

CONSULTING OVER VENTURE BACKING

Distinct from many in the AI startup scene, Liu intentionally pursues a consulting path with Instructor rather than seeking venture capital. He finds more fulfillment in tackling interesting, diverse problems through consulting, such as building AI for insurance or M&A reporting. This approach allows for a sustainable business model focused on delivering value, rather than the immense pressure of scaling to a billion-dollar valuation.

AGENCY, PROCESS, AND MEASURING SUCCESS

The discussion touches on 'high agency,' defined as the courage to act despite fear and to focus on process over outcome metrics. Liu uses pottery and software development as examples, where focusing on the amount of clay used or the number of commits, respectively, leads to skill development. This contrasts with outcome-based metrics that can be easily gamed or are too outcome-dependent.

THE FUTURE OF WORKFLOWS AND PROMPTS AS CODE

Liu envisions a future where LLM interactions are managed through defined workflows and DAGs (Directed Acyclic Graphs), moving away from continuous looping. He reiterates the importance of prompts being treated as code, emphasizing that Instructor separates instructions, data, and output types. This structured approach allows for better control, maintainability, and adaptation of AI systems as business needs evolve.

Mentioned in This Episode

●Products

●Software & Apps

●Companies

●Organizations

●Concepts

●People Referenced

Common Questions

Jason Lou was initially very skeptical of language models, preferring traditional methods like matrix factorization and classical models for tasks like recommendations and image classification. He dismissed them until the release of ChatGPT, which prompted him to acknowledge their incredible potential.

Topics

AI & Machine Learning Technology & Innovation Programming & Software Prompt Engineering AI Engineering Startup Strategy Developer Tools Function Calling LLM SDKs Structured Data Extraction Software Development Philosophy

Mentioned in this video

Software & Apps

OpenAI SDK

The SDK that Instructor is built as a simple wrapper around, aiming to handle response models and simplify LLM interactions.

Flight

A recommendation framework developed by Jason Lou at Stitch Fix, achieving high adoption and processing millions of requests daily.

Marvin

A tool mentioned that, along with Guardrails, used XML and Pydantic for structured data extraction from prompts, built with instruct models in mind.

Instructor

A Python SDK designed to simplify LLM interactions by providing structured data outputs, inspired by the 'requests' library.

LangChain

An LLM framework mentioned in the context of interacting with Instructor and the broader LLM ecosystem.

DSPY

A framework presented as the antithesis to 'Show me the prompt', discussed in relation to short-term metrics and prompt management.

CLIP

Used in conjunction with GPT-3 embeddings and Fe for a similarity search system at Stitch Fix.

Guardrails

A tool mentioned that, along with Marvin, used XML and Pydantic for structured data extraction from prompts, built with instruct models in mind.

Pydantic

A Python library used in Instructor to define schemas and map them, enabling complex data structures and validation.

requests

A Python HTTP library that serves as the philosophical inspiration for Instructor, aiming for similar ease of use and ubiquity.

Prefect

A workflow management tool that Jason Lou uses and discusses in the context of AI dags and modular components.

Notion

A company Jason Lou applied to in January 2023, where he was rejected due to lack of LLM experience.

PostgreSQL

Used by Jason Lou for observability and storing prompt/response data, rather than relying on specialized observability startups.

Daxter

A notable workflow tool mentioned by a host.

GPT-3

An early language model that Jason Lou was skeptical of, but later acknowledged its capabilities after ChatGPT's release.

Sonnet

An Anthropic model praised for its cost-effectiveness and function calling abilities.

LlamaIndex

An LLM framework mentioned in the context of market share alongside LangChain.

ChatGPT

The release of ChatGPT prompted Jason Lou to write an apology letter for his prior skepticism towards language models.

Haiku

An Anthropic model mentioned as having better function calling capabilities and breaking cost-performance trends.

Companies

Anthropic

A company whose models (Sonnet, Haiku) are praised for cost-effectiveness and function calling capabilities, though with minor parsing issues in production.

IBM

Mentioned as an example of a large company that provides fully managed solutions, contrasting with the trend of developers wanting more control.

DataDog

An observability tool used for latency tracking, distinct from LLM-specific observability.

Adobe

Mentioned as an example of a large company that provides fully managed solutions, contrasting with the trend of developers wanting more control.

Zapier

A workflow automation tool mentioned for its extensive connectors and relevance to agent workflows.

Vercel

A platform Jason Lou happily uses, aligning with his preference for developer-centric tools.

Temporal

Competitor to Prefect, mentioned by a host.

Sentry

An observability tool used for latency tracking, distinct from LLM-specific observability.

OpenAI

Mentioned as the current employer of C Chan, who previously worked at Tesla with Karpathy. Also, the company behind GPT-3 and ChatGPT, whose release prompted Jason Lou to apologize for his prior dismissal of LLMs.

Figma

A company Jason Lou applied to in January 2023, where he was rejected due to lack of LLM experience.

Products

YSL

A brand of jacket worn by Jason Lou at the AI Engineer Summit, symbolizing his financial independence.

Concepts

JSON Mode

An alternative input mode for LLMs that outputs JSON, discussed as potentially less robust than function calling for specifying schemas and validation.

Organizations

Rana

Mentioned as an organization whose member, John, is working on inversion models.

South Park Commons

A location where Jason Lou took a sabbatical in New York after a hand injury, during which he explored pottery, Jiu-Jitsu, and LLMs.

Mentioned as a source of the common advice for startups to have co-founders, which Jason Lou suggests is not always necessary.

People

Brian Bishop

A former colleague from Stitch Fix and podcast guest who discussed 'rexus' (RAG) with the hosts.

Hamed Hussein

Author of a post titled 'Show me the prompt', which Jason Lou referenced.

Ask anything from this episode.

Save it, chat with it, and connect it to Claude or ChatGPT. Get cited answers from the actual content — and build your own knowledge base of every podcast and video you care about.

Get Started Free