Distill Llama-3.2-1B-Instruct from Llama-405B-Instruct to make SuperNova-Pico

#14

by Joseph717171 - opened Sep 29

Discussion

Joseph717171

Sep 29

•

edited Sep 29

You guys did amazing with SuperNova-Lite. Can you please make a distillation of Llama-405B-Instruct into Llama-3.2-1B-Instruct to make SuperNova-Pico? Or, perhaps, distill Llama-405B-Instruct into Llama-3.2-3B-Instruct to make SuperNova-Micro, and then prune it down to 1B/1.5B parameters and train-heal it to make SuperNova-Pico?😋

Question: How smart can a 1B parameter SMOL LLM be, and how smart can it become? 🤔😋📱

Crystalcareai

Arcee AI org Oct 2

Working on it :)

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment