Spaces:
Running
Running
title: README | |
emoji: 🐨 | |
colorFrom: pink | |
colorTo: indigo | |
sdk: static | |
pinned: false | |
<h3 align="center">Avoid the hype, check the vibe!</h2> | |
I've cooked up Dataset Viber, a cool set of tools to make your life easier when dealing with data for AI models. Dataset Viber is all about making your data prep journey smooth and fun. It's **not for team collaboration or production**, nor trying to be all fancy and formal - just a bunch of **cool tools to help you collect feedback and do vibe-checks** as an AI engineer or lover. Want to see it in action? Just plug it in and start vibing with your data. It's that easy! | |
- **CollectorInterface**: Lazily collect data of model interactions without human annotation. | |
- **AnnotatorInterface**: Walk through your data and annotate it with models in the loop. | |
- **BulkInterface**: Explore your data distribution and annotate in bulk. | |
- **Embdedder**: Efficiently embed data with ONNX-optimized speeds. | |
Need any tweaks or want to hear more about a specific tool? Just [open an issue](https://github.com/davidberenstein1957/dataset-viber/issues/new) or give me a shout! | |
> [!NOTE] | |
> | |
> - Data is logged to a local CSV or directly to the Hugging Face Hub. | |
> - All tools also run in `.ipynb` notebooks. | |
> - Models in the loop through `fn_model`. | |
> - Input data streamers through `fn_next_input`. | |
> - It supports various tasks for `text`, `chat` and `image` modalities. | |
> - Import and export from the Hugging Face Hub or CSV files. | |
> [!TIP] | |
> Examples can be found in [src/dataset_viber/examples](https://github.com/davidberenstein1957/dataset-viber/tree/main/src/dataset_viber/examples). |