Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
bertin-project
/
bertin-base-gaussian-exp-512seqlen
like
1
Follow
BERTIN Project
20
Fill-Mask
Transformers
PyTorch
JAX
TensorBoard
Joblib
Spanish
roberta
spanish
Inference Endpoints
License:
cc-by-4.0
Model card
Files
Files and versions
Metrics
Training metrics
Community
Train
Deploy
Use this model
main
bertin-base-gaussian-exp-512seqlen
/
outputs
/
checkpoints
/
checkpoint-48000
3 contributors
History:
1 commit
versae
Step... (49001/50000 | Loss: 1.5274475812911987, Acc: 0.6873284578323364): 98%|ββββββββββββββββββββββββββββββ| 49024/50000 [9:31:53<21:29, 1.32s/it]
3b1060f
over 3 years ago
config.json
Safe
618 Bytes
Step... (49001/50000 | Loss: 1.5274475812911987, Acc: 0.6873284578323364): 98%|ββββββββββββββββββββββββββββββ| 49024/50000 [9:31:53<21:29, 1.32s/it]
over 3 years ago
data_collator.joblib
pickle
Detected Pickle imports (5)
"tokenizers.AddedToken"
,
"__main__.FlaxDataCollatorForLanguageModeling"
,
"tokenizers.models.Model"
,
"transformers.models.roberta.tokenization_roberta_fast.RobertaTokenizerFast"
,
"tokenizers.Tokenizer"
How to fix it?
1.47 MB
LFS
Step... (49001/50000 | Loss: 1.5274475812911987, Acc: 0.6873284578323364): 98%|ββββββββββββββββββββββββββββββ| 49024/50000 [9:31:53<21:29, 1.32s/it]
over 3 years ago
flax_model.msgpack
Safe
250 MB
LFS
Step... (49001/50000 | Loss: 1.5274475812911987, Acc: 0.6873284578323364): 98%|ββββββββββββββββββββββββββββββ| 49024/50000 [9:31:53<21:29, 1.32s/it]
over 3 years ago
optimizer_state.msgpack
Safe
500 MB
LFS
Step... (49001/50000 | Loss: 1.5274475812911987, Acc: 0.6873284578323364): 98%|ββββββββββββββββββββββββββββββ| 49024/50000 [9:31:53<21:29, 1.32s/it]
over 3 years ago
training_args.joblib
pickle
Detected Pickle imports (4)
"torch.device"
,
"transformers.trainer_utils.SchedulerType"
,
"transformers.trainer_utils.IntervalStrategy"
,
"transformers.training_args.TrainingArguments"
How to fix it?
1.87 kB
LFS
Step... (49001/50000 | Loss: 1.5274475812911987, Acc: 0.6873284578323364): 98%|ββββββββββββββββββββββββββββββ| 49024/50000 [9:31:53<21:29, 1.32s/it]
over 3 years ago
training_state.json
Safe
15 Bytes
Step... (49001/50000 | Loss: 1.5274475812911987, Acc: 0.6873284578323364): 98%|ββββββββββββββββββββββββββββββ| 49024/50000 [9:31:53<21:29, 1.32s/it]
over 3 years ago