Transformers

Software / App

A Hugging Face library that uses TensorFlow and PyTorch under the hood, providing access to various AI models.

Mentioned in 15 videos

Videos Mentioning Transformers

Beating Google at Search with Neural PageRank and $5M of H200s — with Will Bryk of Exa.ai

Beating Google at Search with Neural PageRank and $5M of H200s — with Will Bryk of Exa.ai

Latent Space

A neural network architecture that inspired Exa's link prediction model, involving predicting the next link in a similar way to predicting the next token.

Taking Responsibility for Your Life, Why Creators Need to Smash Limits, and Dealmaking Strategies

Taking Responsibility for Your Life, Why Creators Need to Smash Limits, and Dealmaking Strategies

Tim Ferriss

Major toy and media franchise, mentioned as a competitor that McFarlane Toys surpassed in sales records.

A Comprehensive Overview of Large Language Models - Latent Space Paper Club

A Comprehensive Overview of Large Language Models - Latent Space Paper Club

Latent Space

A neural network architecture that relies heavily on self-attention mechanisms, revolutionizing NLP.

Why Google failed to make GPT-3 -- with David Luan of Adept

Why Google failed to make GPT-3 -- with David Luan of Adept

Latent Space

A neural network architecture that revolutionized NLP and is fundamental to modern LLMs and AI agent development.

Open Source AI is AI we can Trust — with Soumith Chintala of Meta AI

Open Source AI is AI we can Trust — with Soumith Chintala of Meta AI

Latent Space

A neural network architecture, with its ability to handle numbers debated due to tokenizer issues.

Jay McClelland: Neural Networks and the Emergence of Cognition | Lex Fridman Podcast #222

Jay McClelland: Neural Networks and the Emergence of Cognition | Lex Fridman Podcast #222

Lex Fridman

A class of neural network architectures, an idea that Jeff Hinton essentially introduced in one of his 1981 papers, highlighting his foresight in the field.

E129: Sam Altman plays chess with regulators, AI's "nuclear" potential, big pharma bundling & more

E129: Sam Altman plays chess with regulators, AI's "nuclear" potential, big pharma bundling & more

All-In Podcast

A type of neural network architecture discussed in relation to AI model functionality and understanding.

E122: Is AI the next great computing platform? ChatGPT vs. Google, containing AGI & RESTRICT Act

E122: Is AI the next great computing platform? ChatGPT vs. Google, containing AGI & RESTRICT Act

All-In Podcast

A foundational research paper by Google in 2017 that OpenAI leveraged for commercialization.

Ep 18: Petaflops to the People — with George Hotz of tinycorp

Ep 18: Petaflops to the People — with George Hotz of tinycorp

Latent Space

Transformers are discussed for their reliance on semi-weight sharing and dynamic weight generation, rather than just 'attention'.

Building Dota Bots That Beat Pros - OpenAI's Greg Brockman, Szymon Sidor, and Sam Altman

Building Dota Bots That Beat Pros - OpenAI's Greg Brockman, Szymon Sidor, and Sam Altman

Y Combinator

AI models that learn to predict the next character in a sequence, potentially learning complex tasks like sentiment analysis.

Efficient Computing for Deep Learning, Robotics, and AI (Vivienne Sze) | MIT Deep Learning Series

Efficient Computing for Deep Learning, Robotics, and AI (Vivienne Sze) | MIT Deep Learning Series

Lex Fridman

A recent and popular type of neural network architecture often involving attention mechanisms and matrix multiplication.

Will Sasso: Comedy, MADtv, AI, Friendship, Madness, and Pro Wrestling | Lex Fridman Podcast #323

Will Sasso: Comedy, MADtv, AI, Friendship, Madness, and Pro Wrestling | Lex Fridman Podcast #323

Lex Fridman

The type of neural network mechanisms that enabled breakthroughs in AI art generation by capturing deep language representations.

Python for AI #4: Model Hubs & HuggingFace Tutorial

Python for AI #4: Model Hubs & HuggingFace Tutorial

AssemblyAI

A Hugging Face library that uses TensorFlow and PyTorch under the hood, providing access to various AI models.

The 7 Most Powerful Moats For AI Startups

The 7 Most Powerful Moats For AI Startups

Y Combinator

Demis Hassabis: DeepMind - AI, Superintelligence & the Future of Humanity | Lex Fridman Podcast #299

Demis Hassabis: DeepMind - AI, Superintelligence & the Future of Humanity | Lex Fridman Podcast #299

Lex Fridman

Models that represent huge leaps in AI evolution since 2010, contributing to the success of large language models.