NeMo Tron 3

Software / App

A model that uses Mamba 2 as a lightweight layer, alternating it with softmax attention to manage inference cost and expressiveness.

Mentioned in 1 video