Upload README.md
Browse files
README.md
CHANGED
@@ -54,10 +54,12 @@ These files were quantised using hardware kindly provided by [Massed Compute](ht
|
|
54 |
|
55 |
AWQ is an efficient, accurate and blazing-fast low-bit weight quantization method, currently supporting 4-bit quantization. Compared to GPTQ, it offers faster Transformers-based inference with equivalent or better quality compared to the most commonly used GPTQ settings.
|
56 |
|
|
|
|
|
57 |
It is supported by:
|
58 |
|
59 |
- [Text Generation Webui](https://github.com/oobabooga/text-generation-webui) - using Loader: AutoAWQ
|
60 |
-
- [vLLM](https://github.com/vllm-project/vllm) -
|
61 |
- [Hugging Face Text Generation Inference (TGI)](https://github.com/huggingface/text-generation-inference)
|
62 |
- [Transformers](https://huggingface.co/docs/transformers) version 4.35.0 and later, from any code or client that supports Transformers
|
63 |
- [AutoAWQ](https://github.com/casper-hansen/AutoAWQ) - for use from Python code
|
@@ -357,7 +359,7 @@ Donaters will get priority support on any and all AI/LLM/model questions and req
|
|
357 |
|
358 |
**Special thanks to**: Aemon Algiz.
|
359 |
|
360 |
-
**Patreon special mentions**:
|
361 |
|
362 |
|
363 |
Thank you to all my generous patrons and donaters!
|
@@ -368,13 +370,18 @@ And thank you again to a16z for their generous grant.
|
|
368 |
|
369 |
# Original model card: Carsten Kragelund's Chronomaid Storytelling 13B
|
370 |
|
|
|
371 |
# Chronomaid-Storytelling-13b
|
372 |
|
|
|
|
|
373 |
Merge including [Noromaid-13b-v0.1.1](https://huggingface.co/NeverSleep/Noromaid-13b-v0.1.1), and [Chronos-13b-v2](https://huggingface.co/elinas/chronos-13b-v2) with the [Storytelling-v1-Lora](https://huggingface.co/Undi95/Storytelling-v1-13B-lora) applied afterwards
|
374 |
|
|
|
|
|
375 |
## Prompt Format
|
376 |
|
377 |
-
Tested with Alpaca, the Noromaid preset's will probably also work
|
378 |
```
|
379 |
Below is an instruction that describes a task. Write a response that appropriately completes the request.
|
380 |
|
@@ -384,4 +391,17 @@ Below is an instruction that describes a task. Write a response that appropriate
|
|
384 |
### Response:
|
385 |
```
|
386 |
|
387 |
-
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
54 |
|
55 |
AWQ is an efficient, accurate and blazing-fast low-bit weight quantization method, currently supporting 4-bit quantization. Compared to GPTQ, it offers faster Transformers-based inference with equivalent or better quality compared to the most commonly used GPTQ settings.
|
56 |
|
57 |
+
AWQ models are currently supported on Linux and Windows, with NVidia GPUs only. macOS users: please use GGUF models instead.
|
58 |
+
|
59 |
It is supported by:
|
60 |
|
61 |
- [Text Generation Webui](https://github.com/oobabooga/text-generation-webui) - using Loader: AutoAWQ
|
62 |
+
- [vLLM](https://github.com/vllm-project/vllm) - version 0.2.2 or later for support for all model types.
|
63 |
- [Hugging Face Text Generation Inference (TGI)](https://github.com/huggingface/text-generation-inference)
|
64 |
- [Transformers](https://huggingface.co/docs/transformers) version 4.35.0 and later, from any code or client that supports Transformers
|
65 |
- [AutoAWQ](https://github.com/casper-hansen/AutoAWQ) - for use from Python code
|
|
|
359 |
|
360 |
**Special thanks to**: Aemon Algiz.
|
361 |
|
362 |
+
**Patreon special mentions**: Michael Levine, 阿明, Trailburnt, Nikolai Manek, John Detwiler, Randy H, Will Dee, Sebastain Graf, NimbleBox.ai, Eugene Pentland, Emad Mostaque, Ai Maven, Jim Angel, Jeff Scroggin, Michael Davis, Manuel Alberto Morcote, Stephen Murray, Robert, Justin Joy, Luke @flexchar, Brandon Frisco, Elijah Stavena, S_X, Dan Guido, Undi ., Komninos Chatzipapas, Shadi, theTransient, Lone Striker, Raven Klaugh, jjj, Cap'n Zoog, Michel-Marie MAUDET (LINAGORA), Matthew Berman, David, Fen Risland, Omer Bin Jawed, Luke Pendergrass, Kalila, OG, Erik Bjäreholt, Rooh Singh, Joseph William Delisle, Dan Lewis, TL, John Villwock, AzureBlack, Brad, Pedro Madruga, Caitlyn Gatomon, K, jinyuan sun, Mano Prime, Alex, Jeffrey Morgan, Alicia Loh, Illia Dulskyi, Chadd, transmissions 11, fincy, Rainer Wilmers, ReadyPlayerEmma, knownsqashed, Mandus, biorpg, Deo Leter, Brandon Phillips, SuperWojo, Sean Connelly, Iucharbius, Jack West, Harry Royden McLaughlin, Nicholas, terasurfer, Vitor Caleffi, Duane Dunston, Johann-Peter Hartmann, David Ziegler, Olakabola, Ken Nordquist, Trenton Dambrowitz, Tom X Nguyen, Vadim, Ajan Kanaga, Leonard Tan, Clay Pascal, Alexandros Triantafyllidis, JM33133, Xule, vamX, ya boyyy, subjectnull, Talal Aujan, Alps Aficionado, wassieverse, Ari Malik, James Bentley, Woland, Spencer Kim, Michael Dempsey, Fred von Graf, Elle, zynix, William Richards, Stanislav Ovsiannikov, Edmond Seymore, Jonathan Leane, Martin Kemka, usrbinkat, Enrico Ros
|
363 |
|
364 |
|
365 |
Thank you to all my generous patrons and donaters!
|
|
|
370 |
|
371 |
# Original model card: Carsten Kragelund's Chronomaid Storytelling 13B
|
372 |
|
373 |
+
|
374 |
# Chronomaid-Storytelling-13b
|
375 |
|
376 |
+
<img src="https://cdn-uploads.huggingface.co/production/uploads/65221315578e7da0d74f73d8/v2fVXhCcOdvOdjTrd9dY0.webp" alt="image of a vibrant and whimsical scene with an anime-style character as the focal point. The character is a young girl with blue eyes and short brown hair, wearing a black and white maid outfit with ruffled apron and a red ribbon at her collar. She is lying amidst a fantastical backdrop filled with an assortment of floating, colorful clocks, gears, and hourglasses. The space around her is filled with sparkling stars, glowing nebulae, and swirling galaxies." height="75%" width="75%" />
|
377 |
+
|
378 |
Merge including [Noromaid-13b-v0.1.1](https://huggingface.co/NeverSleep/Noromaid-13b-v0.1.1), and [Chronos-13b-v2](https://huggingface.co/elinas/chronos-13b-v2) with the [Storytelling-v1-Lora](https://huggingface.co/Undi95/Storytelling-v1-13B-lora) applied afterwards
|
379 |
|
380 |
+
Inteded for primarily RP, and will do ERP, narrator-character and group-chats without much trouble in my testing.
|
381 |
+
|
382 |
## Prompt Format
|
383 |
|
384 |
+
Tested with Alpaca, the Noromaid preset's will probably also work (check the Noromaid model card for SillyTavern presets)
|
385 |
```
|
386 |
Below is an instruction that describes a task. Write a response that appropriately completes the request.
|
387 |
|
|
|
391 |
### Response:
|
392 |
```
|
393 |
|
394 |
+
## Sampler Settings
|
395 |
+
|
396 |
+
Tested at
|
397 |
+
* `temp` 1.3 `min p` 0.05 and 0.15
|
398 |
+
* `temp` 1.7, `min p` 0.08 and 0.15
|
399 |
+
|
400 |
+
## Quantized Models
|
401 |
+
The model has been kindly quantized in GGUF, AWQ, and GPTQ by TheBloke
|
402 |
+
Find them in the [Chronomaid-Storytelling-13b Collection](https://huggingface.co/collections/NyxKrage/chronomaid-storytelling-13b-656115dd7065690d7f17c7c8)
|
403 |
+
|
404 |
+
## Thanks ❤️
|
405 |
+
|
406 |
+
To [Undi](https://huggingface.co/Undi95) & [Ikari](https://huggingface.co/IkariDev) for Noromaid and [Elinas](https://huggingface.co/elinas) for Chronos
|
407 |
+
Support [Undi](https://ko-fi.com/undiai) and [Elinas](https://ko-fi.com/elinas) on Kofi
|