Commit History

Keep only best performing model in main (all models still available in develop branch). Best performing model is clip_spanish_141230_samples with a loss of 2.231235980987549
864f8c7

edugp commited on

Add model using BERTIN language encoder trained on 146364 examples and obtaining a loss of 2.231235980987549
505012b

edugp commited on

Refactor test_on_image.py
c309418

edugp commited on

Update README and add a training doc
5019883

edugp commited on

Add actual files instead of symlinks
2b6deb0

edugp commited on

Copy latest model to repo root
2bc8c97

edugp commited on

Add model trained on 141230 samples, achieving a validation loss of 2.1384739875793457
e101844

edugp commited on

Add all necessary files to replicate training run
2daf3c7

edugp commited on

Add model trained on 72972 samples
8a1113b

edugp commited on

Add new model trained on the spanish subset of suitable images of the 20% of the WIT dataset, using a 998/1/1 train/valid/test split with a validation loss of 22.3439
70c70fa

edugp commited on

Add new model trained on the spanish subset of suitable images of the 20% of the WIT dataset, with a validation loss of 2.4001
049fefd

edugp commited on

Use 1 caption per image
3fa7433

edugp commited on

Add README
a618bc2

edugp commited on

Update downloading and training scripts
98c2b8e

edugp commited on

Resolve merge conflict in .gitattributes
a77e1f7

edugp commited on

Add training scripts and initial model trained on 1% of the data.
8e2b754

edugp commited on

initial commit
30ae6fe

system HF staff commited on