rombodawg
/

Llama-3-8B-Instruct-Coder

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

rombodawg commited on Jun 6

Commit

27c1e72

•

1 Parent(s): 7d8cae3

Update README.md

Files changed (1) hide show

README.md +0 -4

README.md CHANGED Viewed

@@ -20,10 +20,6 @@ This model is llama-3-8b-instruct from Meta (uploaded by unsloth) trained on the
 The Qalore method uses Qlora training along with the methods from Galore for additional reductions in VRAM allowing for llama-3-8b to be loaded on 14.5 GB of VRAM. This allowed this training to be completed on an RTX A4000 16GB in 130 hours for less than $20.
-Dataset used for training this model:
-- https://huggingface.co/datasets/Replete-AI/OpenCodeInterpreterData
 Qalore notebook for training:
 - https://colab.research.google.com/drive/1bX4BsjLcdNJnoAf7lGXmWOgaY8yekg8p?usp=sharing

 The Qalore method uses Qlora training along with the methods from Galore for additional reductions in VRAM allowing for llama-3-8b to be loaded on 14.5 GB of VRAM. This allowed this training to be completed on an RTX A4000 16GB in 130 hours for less than $20.
 Qalore notebook for training:
 - https://colab.research.google.com/drive/1bX4BsjLcdNJnoAf7lGXmWOgaY8yekg8p?usp=sharing