UsernameJustAnother
/

Nemo-12B-Marlin-v1

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

UsernameJustAnother commited on Aug 1

Commit

095a44a

•

1 Parent(s): 85f843f

Update README.md

Files changed (1) hide show

README.md +8 -1

README.md CHANGED Viewed

@@ -20,7 +20,14 @@ tags:
 Experimental RP Finetune with secret sauce dataset, rsLoRA, r = 64, on an Colab A100 instance. 30GB vRAM used, 2 epochs ~ 3hrs of training.
 ```
-         r = 64,
         target_modules = ["q_proj", "k_proj", "v_proj", "o_proj",
                           "gate_proj", "up_proj", "down_proj",],
         lora_alpha = 64,

 Experimental RP Finetune with secret sauce dataset, rsLoRA, r = 64, on an Colab A100 instance. 30GB vRAM used, 2 epochs ~ 3hrs of training.
 ```
+==((====))==  Unsloth - 2x faster free finetuning | Num GPUs = 1
+   \\   /|    Num examples = 8,160 | Num Epochs = 2
+O^O/ \_/ \    Batch size per device = 2 | Gradient Accumulation steps = 4
+\        /    Total batch size = 8 | Total steps = 2,040
+ "-____-"     Number of trainable parameters = 228,065,280
+        r = 64,
         target_modules = ["q_proj", "k_proj", "v_proj", "o_proj",
                           "gate_proj", "up_proj", "down_proj",],
         lora_alpha = 64,