Text Generation
scaling
umup-research-1b-bf16 / model_state_layer_17_LayerNormWrapper.pt

Commit History