Longformer

Software / App

A transformer-based language model designed to handle much longer sequences than standard BERT, which Michael Royzen utilized for its larger context window.

Mentioned in 1 video