Text Generation
scaling
umup-research-7b-bf16 / model_state_layer_34_TransformerLMHead.pt

Commit History