RL post training

Concept

A framework Terminal Bench is evolving into, allowing for post-training models using reinforcement learning.

Mentioned in 1 video