Gemma 9B

Software / App

A smaller model used in experiments to demonstrate that generating more data from it and distilling to a larger model can be more effective than using the larger model's own data.

Mentioned in 1 video