Update README.md
Browse files
README.md
CHANGED
@@ -28,14 +28,16 @@ pipeline_tag: image-classification
|
|
28 |
The model is used to classify images into one of the 51 North American swallowtail or cattleheart butterfly species. `resnet50` was used for training.
|
29 |
|
30 |
## Intended uses & limitations
|
31 |
-
The model was trained on 8577 insect images spread over 51 species. The model is likely biased toward some species being more
|
32 |
|
33 |
## Training and evaluation data
|
34 |
|
35 |
The images used in training were obtained from GBIF:
|
36 |
GBIF.org (22 June 2023) GBIF Occurrence Download https://doi.org/10.15468/dl.bqg8bw
|
37 |
|
38 |
-
Only the first 400 images of each species (if available) were downloaded.
|
39 |
-
|
40 |
-
The dataset is primarily "in the wild" shots rather than all staged poses, and includes images for which even an expert would not be able to see identifying characteristics (hence the lower overall accuracy).
|
|
|
|
|
41 |
|
|
|
28 |
The model is used to classify images into one of the 51 North American swallowtail or cattleheart butterfly species. `resnet50` was used for training.
|
29 |
|
30 |
## Intended uses & limitations
|
31 |
+
The model was trained on 8577 insect images spread over 51 species. The model is likely biased toward some species being more commonly found in certain habitats.
|
32 |
|
33 |
## Training and evaluation data
|
34 |
|
35 |
The images used in training were obtained from GBIF:
|
36 |
GBIF.org (22 June 2023) GBIF Occurrence Download https://doi.org/10.15468/dl.bqg8bw
|
37 |
|
38 |
+
Only the first 400 images of each species (if available) were downloaded. The image set was partially cleaned for quality to remove caterpillars, poor images or butterflies that were too far away for proper ID. After "cleaning", 200 additional images were downloaded for Battus philenor and Battus polydamas (as those species had a very high percentage of caterpillar shots).
|
39 |
+
|
40 |
+
The dataset is primarily "in the wild" shots rather than all staged poses, and includes images for which even an expert would not be able to see identifying characteristics (hence the lower overall accuracy).
|
41 |
+
|
42 |
+
The image set had a minimum of 30 pics in a class for the less uncommon species (which is not enough for accurate training but they were included for completeness). 33 species had over 200 images (after cleaning).
|
43 |
|