Pi3141
/

alpaca-lora-30B-ggml

Model card Files Files and versions Community

llama.cpp: llama_model_load: llama_model_load: tensor 'tok_embeddings.weight' has wrong size in model file main

by c2d03041 - opened Mar 21, 2023

Mar 21, 2023

The fix is this:

diff --git a/main.cpp b/main.cpp
index 3321818..3fa487b 100644
--- a/main.cpp
+++ b/main.cpp
@@ -34,8 +34,8 @@ static const int EOS_TOKEN_ID = 2;
 // determine number of model parts based on the dimension
 static const std::map<int, int> LLAMA_N_PARTS = {
     { 4096, 1 },
-    { 5120, 2 },
-    { 6656, 4 },
+    { 5120, 1 },
+    { 6656, 1 },
     { 8192, 8 },
 };
 
@@ -43,10 +43,10 @@ static const std::map<int, int> LLAMA_N_PARTS = {
 struct llama_hparams {
     int32_t n_vocab = 32000;
     int32_t n_ctx   = 512;   // this is provided as user input?
-    int32_t n_embd  = 4096;
+    int32_t n_embd  = 5120;
     int32_t n_mult  = 256;
-    int32_t n_head  = 32;
-    int32_t n_layer = 32;
+    int32_t n_head  = 40;
+    int32_t n_layer = 40;
     int32_t n_rot   = 64;
     int32_t f16     = 1;
 };

c2d03041 changed discussion title from lama.cpp: llama_model_load: llama_model_load: tensor 'tok_embeddings.weight' has wrong size in model file main to llama.cpp: llama_model_load: llama_model_load: tensor 'tok_embeddings.weight' has wrong size in model file main Mar 21, 2023

mdimai666

Mar 21, 2023

its work for me) thanks!

Pi3141

Owner Mar 21, 2023

Glad you got it to work. However, just one little comment. It's not anything major that will break stuff.

llama_hparams doesn't need to be changed since that's the default Paramus for 7B. If you want to change it, here are the correct values for 30B:
int32_t n_embd: 6656
int32_t n_head: 52
int32_t n_layer: 60

Agcuevas

Mar 22, 2023

how can I recompile the main.cpp once i make the corrections? The "npx dalai alpaca install" command takes the old main.cpp from a repository.

Pi3141

Owner Mar 22, 2023

how can I recompile the main.cpp once i make the corrections? The "npx dalai alpaca install" command takes the old main.cpp from a repository.

Modify alpaca.js in Dalai and change https://github.com/candywrap/alpaca.cpp.git to https://github.com/ItsPi3141/alpaca.cpp.git

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment