MT-Bench

Software / AppMentioned in 2 videos

A static benchmark developed by LMSys, inspired by Chatbot Arena, for evaluating LLMs on multi-turn conversations.