H/W resources
#4
by
r2d209
- opened
I want to know H/W resources you used to train this model.
like GPU(a100 or... something else), GPU RAM size
Yes, it was trained on 7 A100 80GB GPUs. But it's a bit of an overkill. It was done mainly cuz I was working with a very large custom dataset.
I have been successful in training the same model also on a T4 GPU using DeepSpeed. And you could use an even smaller GPU if you utilize PEFT