test-adapter / README.md
hitorilabs's picture
Update README.md
6ee341c
|
raw
history blame
308 Bytes
---
license: llama2
---
Trained using TRL, it didn't fit properly on my 3090 without significantly dropping batch size and applying 4-bit quantization.
It didn't exactly converge.
![training_run.png](https://cdn-uploads.huggingface.co/production/uploads/64075c834dc5f2846c96bc98/b-Tn5IDcRubZp_AyfLNg7.png)