LLaMA 2

Software / App

An open-weights model from Meta that DeepSeek paid attention to, influencing their approach.

Mentioned in 25 videos

Videos Mentioning LLaMA 2

All-In Summit: Bill Gurley presents 2,851 Miles

All-In Summit: Bill Gurley presents 2,851 Miles

All-In Podcast

An AI model developed by Meta, seen as a significant development and potential threat to established tech companies.

Training Llama 2, 3 & 4: The Path to Open Source AGI — with Thomas Scialom of Meta AI

Training Llama 2, 3 & 4: The Path to Open Source AGI — with Thomas Scialom of Meta AI

Latent Space

Meta AI's second-generation large language model, a priority project that focused on instruction following and chat capabilities.

[LLM Paper Club] Llama 3.1 Paper: The Llama Family of Models

[LLM Paper Club] Llama 3.1 Paper: The Llama Family of Models

Latent Space

Previous version of Meta's large language model, noted for intentionally not focusing on code generation initially.

E169: Elon sues OpenAI, Apple's decline, TikTok ban, Bitcoin $100K?, Science corner: Microplastics

E169: Elon sues OpenAI, Apple's decline, TikTok ban, Bitcoin $100K?, Science corner: Microplastics

All-In Podcast

An open-source large language model, specifically the 70B parameter version, supported by Groq's platform.

E143: Nvidia smashes earnings, Arm walks the plank, M&A market, Vivek dominates GOP debate & more

E143: Nvidia smashes earnings, Arm walks the plank, M&A market, Vivek dominates GOP debate & more

All-In Podcast

An open-source large language model developed by Meta, mentioned as an example of big companies making advanced models freely available, which is good for startups.

This World Does Not Exist — Joscha Bach, Karan Malhotra, Rob Haisfield (WorldSim, WebSim, Liquid AI)

This World Does Not Exist — Joscha Bach, Karan Malhotra, Rob Haisfield (WorldSim, WebSim, Liquid AI)

Latent Space

A large language model, whose C implementation by Andrej Karpathy was translated by Devin AI.

Breaking down the OG GPT Paper by Alec Radford

Breaking down the OG GPT Paper by Alec Radford

Latent Space

Mentioned as an example of a large language model that can be continuously pre-trained on domain-specific data.

Jared Kushner: Israel-Hamas War, paths forward, macro picture, AI

Jared Kushner: Israel-Hamas War, paths forward, macro picture, AI

All-In Podcast

An open-source language model, used as a benchmark for Kai-Fu Lee's model and Grok.

E152: Real estate chaos, WeWork bankruptcy, Biden regulates AI, Ukraine's “Cronkite Moment” & more

E152: Real estate chaos, WeWork bankruptcy, Biden regulates AI, Ukraine's “Cronkite Moment” & more

All-In Podcast

An open-source AI model from Meta, mentioned with limitations regarding user thresholds before requiring Facebook's involvement.

[Paper Club] Upcycling Large Language Models into Mixture of Experts

[Paper Club] Upcycling Large Language Models into Mixture of Experts

Latent Space

Mentioned in the context of fine-tuning and comparing cosine similarity with upcycled models.

Personal AI Meetup - Bee, BasedHardware, LangChain LangFriend, Deepgram EmilyAI

Personal AI Meetup - Bee, BasedHardware, LangChain LangFriend, Deepgram EmilyAI

Latent Space

An LLM model that some customers run on their own infrastructure to minimize network latency.

All-In Summit: In conversation with Vinod Khosla

All-In Summit: In conversation with Vinod Khosla

All-In Podcast

A large language model from Meta that the company is reportedly trying to enhance.

Open Source AI is AI we can Trust — with Soumith Chintala of Meta AI

Open Source AI is AI we can Trust — with Soumith Chintala of Meta AI

Latent Space

The second generation of Meta's open-source LLaMA models, with which Soumith Chintala was more closely involved.

A Brief History of the Open Source AI Hacker - with Ben Firshman of Replicate

A Brief History of the Open Source AI Hacker - with Ben Firshman of Replicate

Latent Space

A large language model released by Meta, which significantly drove growth for Replicate due to its open nature and trainability.

The State of AI in production — with David Hsu of Retool

The State of AI in production — with David Hsu of Retool

Latent Space

Mentioned as an open-source model that currently lags behind GPT-4 in performance, leading customers to prefer hosted models.

The Four Wars of the AI Stack - Dec 2023 Recap

The Four Wars of the AI Stack - Dec 2023 Recap

Latent Space

A model that people are focusing on fine-tuning.

The Origin and Future of RLHF: the secret ingredient for ChatGPT - with Nathan Lambert

The Origin and Future of RLHF: the secret ingredient for ChatGPT - with Nathan Lambert

Latent Space

Meta's language model, whose paper cited the effectiveness of RLHF and noted the surprise of NLP researchers at its utility, highlighting its cost and time effectiveness. Used rejection sampling for RLHF process.

Beating GPT-4 with Open Source Models - with Michael Royzen of Phind

Beating GPT-4 with Open Source Models - with Michael Royzen of Phind

Latent Space

The second generation of Meta AI's LLaMA large language models, serving as the foundation for Phind's own fine-tuned models.

Why AI Agents Don't Work (yet) - with Kanjun Qiu of Imbue

Why AI Agents Don't Work (yet) - with Kanjun Qiu of Imbue

Latent Space

A large language model from Meta AI, used as an example for the cost of data labeling versus compute.

The End of Finetuning — with Jeremy Howard of Fast.ai

The End of Finetuning — with Jeremy Howard of Fast.ai

Latent Space

The base model for Code Llama, which was fine-tuned by Meta.

RAG is a hack - with Jerry Liu of LlamaIndex

RAG is a hack - with Jerry Liu of LlamaIndex

Latent Space

A popular open-source model that LlamaIndex integrates with, allowing for self-hosting deployments.

FlashAttention-2: Making Transformers 800% faster AND exact

FlashAttention-2: Making Transformers 800% faster AND exact

Latent Space

Meta's latest large language model, released with less restrictive licensing, promoting wider business use and fine-tuning.

"OpenAI is Not God” - The DeepSeek Documentary on Liang Wenfeng, R1 and What's Next

"OpenAI is Not God” - The DeepSeek Documentary on Liang Wenfeng, R1 and What's Next

AI Explained

An open-weights model from Meta that DeepSeek paid attention to, influencing their approach.

Marc Benioff | All-In Summit 2024

Marc Benioff | All-In Summit 2024

All-In Podcast

An open-source large language model developed by Meta AI, mentioned as a potential foundation model for custom AI development.

Page 1 of 2Next