File size: 7,402 Bytes
00a6117 0f6384e 2925192 0f6384e 2925192 0f6384e 97c6169 902b1c2 0f6384e a79e852 364befe f75e2b3 364befe 0f6384e 7300e5e 0f6384e 2715830 0f6384e 00a6117 |
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 81 82 83 84 85 86 87 88 89 90 91 92 93 94 95 96 97 98 99 100 101 102 103 104 105 106 107 108 109 110 111 112 113 114 115 116 117 118 119 120 121 122 123 124 125 126 127 128 129 130 131 132 133 134 135 136 137 138 139 140 141 142 143 144 145 146 147 148 149 150 151 152 153 154 155 156 157 158 159 160 161 162 163 164 165 166 167 168 169 170 171 172 173 174 |
---
license: mit
language:
- en
base_model:
- Ultralytics/YOLOv8
pipeline_tag: object-detection
---
W.A.L.D.O.
Whereabouts Ascertainment for Low-lying Detectable Objects
---------------------------------------------------------------------
[![WALDO 3.0 preview vid](https://i.imgur.com/hGghrLn.jpeg)](https://www.youtube.com/watch?v=1y5y9yklj2U)
Welcome to the WALDO v3.0 public release
---------------------------------------------------------------------
WHAT IS WALDO?
WALDO is a detection AI model, based on a large YOLO-v8 backbone and my own
synthetic data pipeline. **The model is capable of detecting these classes
of items in overhead imagery ranging in altitude from about 30 feet to
satellite imagery!**
Output classes:
0 -> 'LightVehicle' --> all kinds of civilan cars, including pickup trucks, vans etc... πποΈπππ </br>
1 -> 'Person' --> people! all kinds of people including ones that are on bikes or swimming in the sea π§ββοΈπΊππ§π½ββοΈππ§</br>
2 -> 'Building' --> all kinds of buildings πποΈππ‘</br>
3 -> 'UPole' --> utility poles, power poles, anything thin and sticking up that you should avoid with a drone π</br>
4 -> 'Boat' --> boats, ships, canoes, kayaks, surf boards... all the floaty stuff π’π</br>
5 -> 'Bike' --> bikes, mopeds, motorbikes, all stuff with 2 wheels π²</br>
6 -> 'Container' --> shipping containers, including on the back of an articulated truck... π¦ποΈ</br>
7 -> 'Truck' --> large commercial vehicles including articulated trucks or big box-on-chassis delivery trucks π</br>
8 -> 'Gastank'--> cylindrical tanks such as butane tanks and gas expansion tanks, or grain silos... pretty much anything that looks cylindrical for storing liquids π«</br>
10 -> 'Digger' --> all kinds of construction vehicles, including tractors and construction gear π</br>
11 -> 'Solarpanels' --> solar panels βͺοΈπβͺοΈ</br>
12 -> 'Bus' --> a bus π</br>
--> In general the lower the class number the better-trained you can expect it to be.
For users of previous versions of WALDO: note that I removed the military class and smoke detection. This is meant to be a FOSS tool for civilian use and I don't want to pursue making it work for military applications.
---------------------------------------------------------------------
WHERE IS WALDO?
Right here on HF!
Note there are a couple more models that have slightly better performance over on Gumroad here: https://6228189440665.gumroad.com/l/WALDOv3
Those are for sale as a kind of sponsorship for the project: if you find value in the free ones here you can buy those for a nice little performance boost... but it's entirey up to you!
[![P2 model performance boost](https://i.imgur.com/VKa5NN5.png)]
In both cases the actual files are MIT license and you can freely share them, so if someone gives you the ones from Gumroad you are free yo use them including commercially. It's really just a way to offset some of the work and compute that went into making this project and keeping it FOSS.
---------------------------------------------------------------------
WHAT IS IT GOOD FOR?
People are currently using versions of WALDO for:
1. disaster recovery
2. monitoring wildlife sanctuaries (intruder detection)
3. occupancy calculation (parking lots etc..)
4. monitoring infrastructure
5. construction site monitoring
6. traffic flow management
7. crowd counting
8. some fun AI art applications!
9. drone safety (avoiding people / cars on the ground)
10. lots of other fun stuff...
The main reason for me to make WALDO free has in fact been discovering all these cool applications. Let me know what you build!
---------------------------------------------------------------------
FOR AI NERDS !
It's a set of YOLOv8 model, trained on my own datasets of synthetic and "augmented" / semi-synthetic data.
I'm not going to release the dataset for the time being.
The weights are completely open, allowing you to deploy in any number of ways this time!
---------------------------------------------------------------------
HOW CAN I START WITH WALDO?
Check out the boilerplate code in the repo to run the models and output pretty detections using the wonderful Supervision annotation library from Roboflow :)
---------------------------------------------------------------------
GOING DEEPER
Of course if you know your way around deploying AI models there is a lot more you do
with this release, inclusing:
1. fine-tuning the models on your own data (if you know what you are doing, this is probably your starting point)
2. building a nicely optimized sliding-window inference setup that works nicely on your edge hardware
3. quantizing the models for super-duper edge performance on cheap devices
4. using the models to annotate your own data and train something of your own!
Enjoy!
---------------------------------------------------------------------
PREVIOUS VERSIONS
I am retiring the old versions, this is the only one that will stay online.
---------------------------------------------------------------------
CAN YOU HELP ME WITH X?
Sure, email me at [email protected]
---------------------------------------------------------------------
DETECTION OF X ISN'T WORKING AS EXPECTED:
I'd love to see example images, videos, sample data, etc at:
[email protected]
---------------------------------------------------------------------
SUPPORT WALDO!
Visit [![the WALDO gumroad page](https://t.co/kRvhYkVxW2)] to support the project!
---------------------------------------------------------------------
LICENSE
----------------------------------------------------------------------------
Unless otherwise specified all code in this release is published with the
licence conditions below.
----------------------------------------------------------------------------
MIT License
Copyright (c) 2024 Stephan Sturges / Aircortex.com
Permission is hereby granted, free of charge, to any person obtaining a copy
of this software and associated documentation files (the "Software"), to deal
in the Software without restriction, including without limitation the rights
to use, copy, modify, merge, publish, distribute, sublicense, and/or sell
copies of the Software, and to permit persons to whom the Software is
furnished to do so, subject to the following conditions:
The above copyright notice and this permission notice shall be included in all
copies or substantial portions of the Software.
THE SOFTWARE IS PROVIDED "AS IS", WITHOUT WARRANTY OF ANY KIND, EXPRESS OR
IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY,
FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT. IN NO EVENT SHALL THE
AUTHORS OR COPYRIGHT HOLDERS BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER
LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING FROM,
OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR OTHER DEALINGS IN THE
SOFTWARE. |