RWVK: an #RNN-based LLM Feels like the MLP-Mixer moment for LLMs, combining the best of RNNs (longer sequences) and #Transformers (parallelization). The Raven chatbot is impressive and on par with recent models like GPT4All. https://
github.com/BlinkDL/RWKV-LM
RWKV: RNN-based LLM combining RNNs and Transformers capabilities
By
–