dfurman
/

CalmeRys-78B-Orpo-v0.1

Text Generation

text-generation-inference

Model card Files Files and versions Community

dfurman commited on Sep 25

Commit

941f7de

•

1 Parent(s): 7988deb

Update README.md

Files changed (1) hide show

README.md +32 -2

README.md CHANGED Viewed

@@ -23,6 +23,12 @@ quantized_by: dfurman
 This model is a finetune of `MaziyarPanahi/calme-2.4-rys-78b` on 1.5k rows of the `mlabonne/orpo-dpo-mix-40k` dataset.
 ![image/png](https://cdn-uploads.huggingface.co/production/uploads/62afc20ca5bd7cef3e1ab3f4/NG5WGL0ljzLsNhSBRVqnD.png)
 ![image/png](https://cdn-uploads.huggingface.co/production/uploads/62afc20ca5bd7cef3e1ab3f4/Zhk5Bpr1I2NrzX98Bhtp8.png)
@@ -57,7 +63,7 @@ else:
     attn_implementation = "eager"
     torch_dtype = torch.float16
-# quantize if necessary
 # bnb_config = BitsAndBytesConfig(
 #    load_in_4bit=True,
 #    bnb_4bit_quant_type="nf4",
@@ -84,6 +90,30 @@ pipeline = transformers.pipeline(
 ### Example 1
 ```python
 question = """The bakers at the Beverly Hills Bakery baked 200 loaves of bread on Monday morning.
 They sold 93 loaves in the morning and 39 loaves in the afternoon.
@@ -114,7 +144,7 @@ print(outputs[0]["generated_text"][len(prompt):])
 |3|Adjust for returns|Add returned loaves to remaining|74|
 ```
-### Example 2
 ```python
 question = "What's a good recipe for a spicy margarita?"

 This model is a finetune of `MaziyarPanahi/calme-2.4-rys-78b` on 1.5k rows of the `mlabonne/orpo-dpo-mix-40k` dataset.
+It was trained as a generalist language model supporting a variety of text generation use cases, including agentic capabilities, roleplaying, reasoning, multi-turn conversation, long context coherence, and more.
+## 🚅 Training
+Here are a few visualizations of the finetune run:
 ![image/png](https://cdn-uploads.huggingface.co/production/uploads/62afc20ca5bd7cef3e1ab3f4/NG5WGL0ljzLsNhSBRVqnD.png)
 ![image/png](https://cdn-uploads.huggingface.co/production/uploads/62afc20ca5bd7cef3e1ab3f4/Zhk5Bpr1I2NrzX98Bhtp8.png)
     attn_implementation = "eager"
     torch_dtype = torch.float16
+# # quantize if necessary
 # bnb_config = BitsAndBytesConfig(
 #    load_in_4bit=True,
 #    bnb_4bit_quant_type="nf4",
 ### Example 1
+```python
+question = "Is the number 9.11 larger than 9.9?"
+messages = [
+    {"role": "system", "content": "You are a helpful assistant that thinks step by step."},
+    {"role": "user", "content": question},
+]
+prompt = tokenizer.apply_chat_template(messages, tokenize=False, add_generation_prompt=True)
+# print("***Prompt:\n", prompt)
+outputs = pipeline(
+    prompt, max_new_tokens=1000, do_sample=True, temperature=0.01, top_k=50, top_p=0.95
+)
+print("***Generation:")
+print(outputs[0]["generated_text"][len(prompt) :])
+```
+```
+***Generation:
+To compare these two numbers, it's important to look at their decimal places after the whole number part, which is 9 in both cases. Comparing the tenths place, 9.11 has a '1' and 9.9 has a '9'. Since '9' is greater than '1', 9.9 is larger than 9.11.
+```
+### Example 2
 ```python
 question = """The bakers at the Beverly Hills Bakery baked 200 loaves of bread on Monday morning.
 They sold 93 loaves in the morning and 39 loaves in the afternoon.
 |3|Adjust for returns|Add returned loaves to remaining|74|
 ```
+### Example 3
 ```python
 question = "What's a good recipe for a spicy margarita?"