R

RL post training

ConceptMentioned in 1 video

A framework Terminal Bench is evolving into, allowing for post-training models using reinforcement learning.