An academic benchmark used for evaluating AI models, particularly in multi-turn scenarios.
Latent Space