SmilingWolf
commited on
Commit
•
7ece298
1
Parent(s):
296b77d
Update README.md
Browse files
README.md
CHANGED
@@ -2,3 +2,31 @@
|
|
2 |
license: apache-2.0
|
3 |
library_name: timm
|
4 |
---
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
2 |
license: apache-2.0
|
3 |
library_name: timm
|
4 |
---
|
5 |
+
# WD ViT Tagger v3
|
6 |
+
|
7 |
+
Supports ratings, characters and general tags.
|
8 |
+
|
9 |
+
Trained using https://github.com/SmilingWolf/JAX-CV.
|
10 |
+
TPUs used for training kindly provided by the [TRC program](https://sites.research.google/trc/about/).
|
11 |
+
|
12 |
+
## Dataset
|
13 |
+
Last image id: 7220105
|
14 |
+
Trained on Danbooru images with IDs modulo 0000-0899.
|
15 |
+
Validated on images with IDs modulo 0950-0999.
|
16 |
+
Images with less than 10 general tags were filtered out.
|
17 |
+
Tags with less than 600 images were filtered out.
|
18 |
+
|
19 |
+
## Validation results
|
20 |
+
`P=R: threshold = 0.2547, F1 = 0.4278`
|
21 |
+
|
22 |
+
## What's new
|
23 |
+
Model v1.0/Dataset v3:
|
24 |
+
More training images, more and up-to-date tags (up to 2024-02-28).
|
25 |
+
Now `timm` compatible! Load it up and give it a spin using the canonical one-liner!
|
26 |
+
ONNX model is compatible with code developed for the v2 series of models.
|
27 |
+
The batch dimension of the ONNX model is not fixed to 1 anymore. Now you can go crazy with batch inference.
|
28 |
+
Switched to Macro-F1 to measure model performance since it gives me a better gauge of overall training progress.
|
29 |
+
|
30 |
+
## Final words
|
31 |
+
Subject to change and updates.
|
32 |
+
Downstream users are encouraged to use tagged releases rather than relying on the head of the repo.
|