NeMo Tron 3
Software / App
A model that uses Mamba 2 as a lightweight layer, alternating it with softmax attention to manage inference cost and expressiveness.
Mentioned in 1 video
A model that uses Mamba 2 as a lightweight layer, alternating it with softmax attention to manage inference cost and expressiveness.