SmilingWolf commited on
Commit
7ece298
1 Parent(s): 296b77d

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +28 -0
README.md CHANGED
@@ -2,3 +2,31 @@
2
  license: apache-2.0
3
  library_name: timm
4
  ---
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
2
  license: apache-2.0
3
  library_name: timm
4
  ---
5
+ # WD ViT Tagger v3
6
+
7
+ Supports ratings, characters and general tags.
8
+
9
+ Trained using https://github.com/SmilingWolf/JAX-CV.
10
+ TPUs used for training kindly provided by the [TRC program](https://sites.research.google/trc/about/).
11
+
12
+ ## Dataset
13
+ Last image id: 7220105
14
+ Trained on Danbooru images with IDs modulo 0000-0899.
15
+ Validated on images with IDs modulo 0950-0999.
16
+ Images with less than 10 general tags were filtered out.
17
+ Tags with less than 600 images were filtered out.
18
+
19
+ ## Validation results
20
+ `P=R: threshold = 0.2547, F1 = 0.4278`
21
+
22
+ ## What's new
23
+ Model v1.0/Dataset v3:
24
+ More training images, more and up-to-date tags (up to 2024-02-28).
25
+ Now `timm` compatible! Load it up and give it a spin using the canonical one-liner!
26
+ ONNX model is compatible with code developed for the v2 series of models.
27
+ The batch dimension of the ONNX model is not fixed to 1 anymore. Now you can go crazy with batch inference.
28
+ Switched to Macro-F1 to measure model performance since it gives me a better gauge of overall training progress.
29
+
30
+ ## Final words
31
+ Subject to change and updates.
32
+ Downstream users are encouraged to use tagged releases rather than relying on the head of the repo.