Universal Transformer
Software / AppMentioned in 1 video
An early architecture exploring adaptive computation depth, with ideas potentially relevant to future LLM architectures.
An early architecture exploring adaptive computation depth, with ideas potentially relevant to future LLM architectures.