Pretrained (not necessarily LLMs)
Collection
some of my pre-trained models will be present over here.
•
1 item
•
Updated
•
1
More information needed
More information needed
More information needed
The following hyperparameters were used during training:
Hyperparameters | Value |
---|---|
name | Adam |
weight_decay | None |
clipnorm | None |
global_clipnorm | None |
clipvalue | None |
use_ema | False |
ema_momentum | 0.99 |
ema_overwrite_frequency | None |
jit_compile | True |
is_legacy_optimizer | False |
learning_rate | 0.0010000000474974513 |
beta_1 | 0.9 |
beta_2 | 0.999 |
epsilon | 1e-07 |
amsgrad | False |
training_precision | float32 |