BeaverAI
/

Tunguska-39B-v1b-GGUF

Inference Endpoints

Model card Files Files and versions Community

TheDrummer commited on 10 days ago

Commit

6d67d36

•

1 Parent(s): 488e7d0

Update README.md

Files changed (1) hide show

README.md +2 -1

README.md CHANGED Viewed

@@ -126,4 +126,5 @@ Given how the duplicated layers seem to have a stabilizing effect, it begs the q
 ### Can you replicate this effect on normal models by freezing layers?
-### We've so far hypothesized that training 'slowly fills' the duplicated layers. If we intentionally undercook, will the duplicated layers look *underfilled* or can you fill it up with a few steps? In other words, does a single/few updates to the model repair the damage?

 ### Can you replicate this effect on normal models by freezing layers?
+### We've so far hypothesized that training 'slowly fills' the duplicated layers. If we intentionally undercook, will the duplicated layers look *underfilled* or can you fill it up with a few steps? In other words, can a single/few updates to the model reconnect the duplicated layers?
+- Are we really repairing the 'neurons' step-by-step, or have they been significantly rearranged by the first (few?) steps?