fxmarty's picture
Update README.md
f77adfb verified
|
raw
history blame
109 Bytes
---
license: mit
---
This one with a custom `config.head_dim` as allowed by the architecture (see 7b model).