Q_RWKV_6

Software / AppMentioned in 1 video

A 32 billion parameter preview model released by RWKV, created by converting a Qu_32B instruction model by replacing its attention layer with RWKV linear layers.