DoSent

Software / App

A tool that uses language models to inspect agent traces and detect problems, offering a qualitative approach to benchmarking.

Mentioned in 1 video