transformer network
Concept
The underlying architecture for large language models and the embedding network used, which processes tokens through attention layers.
Mentioned in 2 videos
Save the 2 videos on transformer network to your own pod.
Sign up free to keep building your knowledge base on transformer network as more episodes are added.
Videos Mentioning transformer network

Max Tegmark: The Case for Halting AI Development | Lex Fridman Podcast #371
Lex Fridman
A simple computational system used in large language models like GPT-4, which has proven surprisingly effective for advanced AI despite its 'feed forward' architecture.

Vector Search with LLMs- Computerphile
Computerphile
The underlying architecture for large language models and the embedding network used, which processes tokens through attention layers.