ValiantLabs
/

Llama3.1-8B-ShiningValiant2

sequelbox commited on 18 days ago

Commit

050b939

•

1 Parent(s): a97a6fb

Upload folder using huggingface_hub (#5)

- adfacd1fdecad3533c359c30035a150da4a6e0744babe33836ac0fdb654fbd47 (55d49b1c98c08068f079996428a06ceda4167015)
- 89d39802f0f44aab3aaec9fe367bbde5bdeb1f600dc741fc19855a66bf25179e (532a50809d5112e12c2e72cb107ca8f7739ea2a6)
- 154b6296b96d19d09f03b3a2c445b70c6254c729fee49893e751a6f8bd5f4923 (1a1feb7c86b5379c2de530a5707d09186273231f)
- 04e119436f75f623c3d09d0eed91cd5bdcec6c8aed307bdb0a418776a3ed37a3 (c8e81dd24aeddc4f4c43253c3bc3243628ee7b6c)
- aad672a5c32ccddaefef79368d8cca41096ad7c03b3d22377804b1dd0eefa803 (f00cba4ff2556bf9ab84460ee7a9ef48aeb70a34)
- 2699bd4029ecae938e19db9fe8e756f6ed9c30214508311656496453decf4e2f (23f25e7d7f13ff4626ebe028c1b6f439442febac)
- c191b597e4415146f4bb0d3df015d2bd960a03ca7ee497671c00fa16ac390ee5 (5ab58aaf5db3792643d86419f42bb2b7c6be7c78)

Files changed (11) hide show

.gitattributes CHANGED Viewed

@@ -33,3 +33,4 @@ saved_model/**/* filter=lfs diff=lfs merge=lfs -text
 *.zip filter=lfs diff=lfs merge=lfs -text
 *.zst filter=lfs diff=lfs merge=lfs -text
 *tfevents* filter=lfs diff=lfs merge=lfs -text

 *.zip filter=lfs diff=lfs merge=lfs -text
 *.zst filter=lfs diff=lfs merge=lfs -text
 *tfevents* filter=lfs diff=lfs merge=lfs -text
+tokenizer.json filter=lfs diff=lfs merge=lfs -text

README.md CHANGED Viewed

@@ -29,6 +29,7 @@ tags:
 base_model: meta-llama/Meta-Llama-3.1-8B-Instruct
 datasets:
 - sequelbox/Celestia
 - sequelbox/Supernova
 model_type: llama
 model-index:
@@ -261,9 +262,9 @@ Shining Valiant 2 is a chat model built on Llama 3.1 8b, finetuned on our data f
 ## Version
-This is the **2024-09-16** release of Shining Valiant 2 for Llama 3.1 8b.
-We've improved and open-sourced our new baseline [science-instruct dataset](https://huggingface.co/datasets/sequelbox/Celestia). This release features improvements in physics, chemistry, biology, and computer science.
 Future upgrades will continue to expand Shining Valiant's technical knowledge base.
@@ -303,9 +304,9 @@ print(outputs[0]["generated_text"][-1])
 ## The Model
 Shining Valiant 2 is built on top of Llama 3.1 8b Instruct.
-The current version of Shining Valiant 2 is trained on technical knowledge using [sequelbox/Celestia](https://huggingface.co/datasets/sequelbox/Celestia) and general chat capability using [sequelbox/Supernova.](https://huggingface.co/datasets/sequelbox/Supernova)
-Our private data adds specialist knowledge and Shining Valiant's personality: she's friendly, enthusiastic, insightful, knowledgeable, and loves to learn! Magical. (As a general note: we're hoping to replace and open-source this part of Shining Valiant's dataset with synthetic data soon!)
 ![image/jpeg](https://cdn-uploads.huggingface.co/production/uploads/63444f2687964b331809eb55/VCJ8Fmefd8cdVhXSSxJiD.jpeg)

 base_model: meta-llama/Meta-Llama-3.1-8B-Instruct
 datasets:
 - sequelbox/Celestia
+- sequelbox/Spurline
 - sequelbox/Supernova
 model_type: llama
 model-index:
 ## Version
+This is the **2024-11-04** release of Shining Valiant 2 for Llama 3.1 8b.
+This release uses our newest datasets, open-sourced for everyone's use, including our expanded [science-instruct dataset](https://huggingface.co/datasets/sequelbox/Celestia). This release features improvements in logical thinking and structured reasoning as well as physics, chemistry, biology, astronomy, Earth science, computer science, and information theory.
 Future upgrades will continue to expand Shining Valiant's technical knowledge base.
 ## The Model
 Shining Valiant 2 is built on top of Llama 3.1 8b Instruct.
+The current version of Shining Valiant 2 is trained on technical knowledge using [sequelbox/Celestia](https://huggingface.co/datasets/sequelbox/Celestia), complex reasoning using [sequelbox/Spurline](https://huggingface.co/datasets/sequelbox/Spurline), and general chat capability using [sequelbox/Supernova.](https://huggingface.co/datasets/sequelbox/Supernova)
+We're super excited that Shining Valiant's dataset has been fully open-sourced! She's friendly, enthusiastic, insightful, knowledgeable, and loves to learn! Magical.
 ![image/jpeg](https://cdn-uploads.huggingface.co/production/uploads/63444f2687964b331809eb55/VCJ8Fmefd8cdVhXSSxJiD.jpeg)

config.json CHANGED Viewed

@@ -11,6 +11,7 @@
     128008,
     128009
   ],
   "hidden_act": "silu",
   "hidden_size": 4096,
   "initializer_range": 0.02,
@@ -33,7 +34,7 @@
   "rope_theta": 500000.0,
   "tie_word_embeddings": false,
   "torch_dtype": "float32",
-  "transformers_version": "4.44.2",
   "use_cache": true,
   "vocab_size": 128256
 }

     128008,
     128009
   ],
+  "head_dim": 128,
   "hidden_act": "silu",
   "hidden_size": 4096,
   "initializer_range": 0.02,
   "rope_theta": 500000.0,
   "tie_word_embeddings": false,
   "torch_dtype": "float32",
+  "transformers_version": "4.46.1",
   "use_cache": true,
   "vocab_size": 128256
 }

generation_config.json CHANGED Viewed

@@ -8,5 +8,5 @@
   ],
   "temperature": 0.6,
   "top_p": 0.9,
-  "transformers_version": "4.44.2"
 }

   ],
   "temperature": 0.6,
   "top_p": 0.9,
+  "transformers_version": "4.46.1"
 }

model-00001-of-00007.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:dcebe7b4eacb57cbc4e03e60f0d4e1eec8a1471455a3fdbc953edfaca5c8763e
 size 4886466168

 version https://git-lfs.github.com/spec/v1
+oid sha256:6efbffa72857ec90e0ea4310a6025190a4e75eef43e10ec9d46025412e1616a8
 size 4886466168

model-00002-of-00007.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:756b38e9412a00dc12d14823d48c9a71732a1c0318fd9bb48661e9589ddb9ac1
 size 4832007448

 version https://git-lfs.github.com/spec/v1
+oid sha256:c569b9d9276836eb9f31fda31ea667ee3ad1c132b852ec94b4b9a7a2598db0ca
 size 4832007448

model-00003-of-00007.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:4d3ff8801d13032241f11b23af8bf458181a87b41b3e6497cf7cc503a0469ce6
 size 4999813112

 version https://git-lfs.github.com/spec/v1
+oid sha256:10413c97beeea538cb108448193c790d5224192982c2837b1dc3a54a1d5ff50b
 size 4999813112

model-00004-of-00007.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:35ee4a044f0e1c92ba26c63b584ac344740d70fff1f3d86d073810bc8e610d66
 size 4999813128

 version https://git-lfs.github.com/spec/v1
+oid sha256:6ef021115a20513e5b0db4178345a1f4959c59eb73fbb3679aca24055ead5d0e
 size 4999813128

model-00005-of-00007.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:7b6123fecf735935528930e989780254f5bd5eb78b872cda5677f04479d09c25
 size 4832007496

 version https://git-lfs.github.com/spec/v1
+oid sha256:26822b4a9c2cc0f9d92e0c1522f517aac4a20a6b936c706e1ca68ed1beaf8b44
 size 4832007496

model-00006-of-00007.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:895b3445cc9cb423b5c8b67c289eecd411f860ea3d7255857beb8fcb8e990621
 size 4999813120

 version https://git-lfs.github.com/spec/v1
+oid sha256:8f64f7cdbfd3903f7fea88117c49a8533a1ffa928d1ce4354d0d8431faddffe4
 size 4999813120

tokenizer.json CHANGED Viewed

The diff for this file is too large to render. See raw diff