Q_RWKV_6
Software / AppMentioned in 1 video
A 32 billion parameter preview model released by RWKV, created by converting a Qu_32B instruction model by replacing its attention layer with RWKV linear layers.
A 32 billion parameter preview model released by RWKV, created by converting a Qu_32B instruction model by replacing its attention layer with RWKV linear layers.