SmilingWolf
/

wd-vit-tagger-v3

Model card Files Files and versions Community

SmilingWolf commited on Mar 6

Commit

7ece298

•

1 Parent(s): 296b77d

Update README.md

Files changed (1) hide show

README.md +28 -0

README.md CHANGED Viewed

@@ -2,3 +2,31 @@
 license: apache-2.0
 library_name: timm
 ---

 license: apache-2.0
 library_name: timm
 ---
+# WD ViT Tagger v3
+Supports ratings, characters and general tags.
+Trained using https://github.com/SmilingWolf/JAX-CV.
+TPUs used for training kindly provided by the [TRC program](https://sites.research.google/trc/about/).
+## Dataset
+Last image id: 7220105
+Trained on Danbooru images with IDs modulo 0000-0899.
+Validated on images with IDs modulo 0950-0999.
+Images with less than 10 general tags were filtered out.
+Tags with less than 600 images were filtered out.
+## Validation results
+`P=R: threshold = 0.2547, F1 = 0.4278`
+## What's new
+Model v1.0/Dataset v3:
+More training images, more and up-to-date tags (up to 2024-02-28).
+Now `timm` compatible! Load it up and give it a spin using the canonical one-liner!
+ONNX model is compatible with code developed for the v2 series of models.
+The batch dimension of the ONNX model is not fixed to 1 anymore. Now you can go crazy with batch inference.
+Switched to Macro-F1 to measure model performance since it gives me a better gauge of overall training progress.
+## Final words
+Subject to change and updates.
+Downstream users are encouraged to use tagged releases rather than relying on the head of the repo.