README.md · deepghs/anime_classification at c97c1ae36383ac2897a1ae280f3beb2fea149201

metadata

license: mit
datasets:
  - deepghs/anime_classification
metrics:
  - accuracy
pipeline_tag: image-classification
tags:
  - art

The model used to predict the types of anime images, which includes the following four categories:

3D: Images rendered in 3D, including Mikumikudance, Koikatsu, etc.
Bangumi: Screenshots from anime videos.
Comic: Images of manga that contain a significant amount of text or panel sequences.
Illustration: General anime illustrations.

Model	FLOPs	Accuracy	Confusion Matrix	Description
caformer_s36	22.10G	88.19%	Confusion Matrix	Model: caformer_s36 from timm
caformer_s36_plus	22.10G	93.47%	Confusion Matrix	Model: caformer_s36.sail_in22k_ft_in1k_384 pratrained from timm
mobilenetv3	0.63G	88.96%	Confusion Matrix	Model: mobilenetv3_large_100 from timm
mobilenetv3_plus	0.63G	89.92%	Confusion Matrix	Model: mobilenetv3_large_100 from timm, use SCELoss as loss function
mobilevitv2_150	9.09G	88.21%	Confusion Matrix	Model: mobilevitv2_150 from timm