--- license: apache-2.0 --- Original model: [xiaol/RWKV-v5-12B-one-state-chat-16k](https://huggingface.co/xiaol/RWKV-v5-12B-one-state-chat-16k) You can run this model with [ai00_rwkv_server](https://github.com/cgisky1980/ai00_rwkv_server). Although ai00_rwkv_server is mainly for lowend PC, you can run it on servers which are support VULKAN. To try it in Colab, you should install [libnvidia-gl-*](https://packages.ubuntu.com/search?keywords=libnvidia-gl&searchon=names&suite=jammy§ion=all) : ```python !apt -y install libnvidia-gl-535 ``` ---- # Original model card: # Release date: December 18th finetuned from the state-of-the-art (SOTA) model, RWKV v5 12B one state base! More details will be provided soon. Enjoy the incredible performance of this model, which is optimized for systems with 24GB of VRAM and supports fp16. It can be fine-tuned using a single A100 GPU. To execute this model, utilize the [RWKV Runner](https://github.com/josStorer/RWKV-Runner) tool. # Finetuned from [Mobius 12B base](https://huggingface.co/xiaol/Mobius-12B-base) # Usage - [RWKV next web](https://rwkv-next-web.ai-creator.net/) - if use with [RWKV runner](https://github.com/josStorer/RWKV-Runner) or [ai00 server](https://github.com/cgisky1980/ai00_rwkv_server), change default vocabs(tokenizer) by [this one](https://huggingface.co/xiaol/RWKV-v5-12B-one-state-chat-16k/blob/main/rwkv_vocab_v20230424.txt) # Important Notes After overfitting certain instructions and weakening others, it is necessary to use completion or simulate dialogues. - **completion prompt** = 'User: make this content longer:\nxxxxxx\n\nAssistant: ok, longer content is' # Data format `User:xxxx\n\nAssistant:xxx\n\nUser:xxxx\n\nAssistant:xxx\n\n` If you desire optimal performance to run this model,utilize this format and these [vocabs](https://huggingface.co/xiaol/RWKV-v5-12B-one-state-chat-16k/blob/main/rwkv_vocab_v20230424_train.txt)