P0x0
/

Astra-v1-12B-GGUF

Text Generation

general-purpose

Inference Endpoints

Model card Files Files and versions Community

P0x0 commited on Sep 23

Commit

b65d69d

•

1 Parent(s): 3cde4d6

Update README.md

Files changed (1) hide show

README.md +2 -13

README.md CHANGED Viewed

@@ -38,16 +38,5 @@ Astra-v1-12B can be used directly for a wide range of NLP tasks, including:
 ### Out-of-Scope Use
 Astra-v1-12B is not intended for real-time decision-making in critical applications or generating harmful or biased content.
-## How to Get Started with the Model
-```python
-from transformers import AutoModelForCausalLM, AutoTokenizer
-tokenizer = AutoTokenizer.from_pretrained("P0x0/astra-v1-12b")
-model = AutoModelForCausalLM.from_pretrained("P0x0/astra-v1-12b")
-input_text = "Explain the theory of relativity in simple terms."
-inputs = tokenizer(input_text, return_tensors="pt")
-outputs = model.generate(**inputs)
-print(tokenizer.decode(outputs[0], skip_special_tokens=True))

 ### Out-of-Scope Use
 Astra-v1-12B is not intended for real-time decision-making in critical applications or generating harmful or biased content.
+## How to Get Started with the quantized model
+To run the quantized version of the model, you can use [KoboldCPP](https://github.com/LostRuins/koboldcpp), which allows you to run quantized GGUF models locally.