mrm8488
/

limstral-7B-v0.1

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

mrm8488 commited on Nov 1, 2023

Commit

c0ab2b2

•

1 Parent(s): 30af980

Update README.md

Files changed (1) hide show

README.md +17 -1

README.md CHANGED Viewed

@@ -13,7 +13,23 @@ This model is a fine-tuned version of [mistralai/Mistral-7B-v0.1](https://huggin
 ## Training procedure
-The model was loaded on **8 bits** and fine-tuned on the LIMA dataset using the **LoRA** PEFT technique with the `huggingface/peft` library for 2 epochs on 1 x A100 (40GB) GPU.
 LoRA config:
 ```
 config = LoraConfig(

 ## Training procedure
+The model was loaded on **8 bits** and fine-tuned on the LIMA dataset using the **LoRA** PEFT technique with the `huggingface/peft` library and `trl/sft` for 2 epochs on 1 x A100 (40GB) GPU.
+SFT Trainer params:
+```
+trainer = SFTTrainer(
+    model=model,
+    train_dataset=train_ds,
+    eval_dataset=test_ds,
+    peft_config=peft_config,
+    dataset_text_field="text",
+    max_seq_length=2048,
+    tokenizer=tokenizer,
+    args=training_arguments,
+    packing=False
+)
+```
 LoRA config:
 ```
 config = LoraConfig(