Qwen 1.7B

Software / App

A base model used in RLP experiments, showing significant improvements after RLP training.

Mentioned in 1 video