A leaderboard for evaluating models using human evaluation, where LLaMA 3 1B matches LLaMA 2 13B.
Latent Space