Code Marena
Software / App
A code verification benchmark that is Lean-friendly, where Axiom Math's system achieved 99% accuracy in code with proof generation, significantly outperforming other LLMs.
Mentioned in 1 video
A code verification benchmark that is Lean-friendly, where Axiom Math's system achieved 99% accuracy in code with proof generation, significantly outperforming other LLMs.