1 33 199

Ombeline Minelle PRO

OmbelineM

AI & ML interests

Multimodal, creative use of technology, intuitive interface UX / UI, biomimetics and neuroscience

Recent Activity

updated a collection about 12 hours ago

Datasets

liked a Space about 12 hours ago

data-is-better-together/prompt-collective

Reacted to fdaudens's post with ➕ about 12 hours ago

Just tested Argilla's new data annotation feature - it's a game changer for AI project quality. Upload CSVs, work with published datasets, or improve existing ones directly on HuggingFace Hub. Setup took < 2 minutes, no code needed (see example below where I selected a dataset to classify tweets in categories). Real world impact: Missing in Chicago won a Pulitzer using a similar approach - 200 volunteers labeled police misconduct files to train their model. That's the power of good data annotation. Three immediate use cases I see: - Build collaborative training sets with your community (surprisingly underused in AI journalism) - Turn your website chatbot logs into high-quality fine-tuning data - Compare generated vs published content (great for SEO headlines) Works for solo projects or teams up to 100 people. All integrated with HuggingFace Hub for immediate model training. Interesting to see tools like this making data quality more accessible. Data quality is the hidden driver of AI success that we don't talk about enough. - Check out the blogpost: https://huggingface.co/blog/argilla-ui-hub - And the quickstart guide: https://docs.argilla.io/latest/getting_started/quickstart/

View all activity

Organizations

OmbelineM's activity

upvoted a paper 5 days ago

Public Domain 12M: A Highly Aesthetic Image-Text Dataset with Novel Governance Mechanisms

Paper • 2410.23144 • Published 24 days ago • 4

upvoted a collection 5 days ago

SmolLM2

Collection

State-of-the-art compact LLMs for on-device applications: 1.7B, 360M, 135M • 10 items • Updated 2 days ago • 174

upvoted a paper 11 days ago

Mixtral of Experts

Paper • 2401.04088 • Published Jan 8 • 159

upvoted a collection 11 days ago

Comparator

Collection

1 item • Updated 11 days ago • 1

upvoted 3 collections 17 days ago

upvoted 8 papers 24 days ago

LCM-LoRA: A Universal Stable-Diffusion Acceleration Module

Paper • 2311.05556 • Published Nov 9, 2023 • 81

Cambrian-1: A Fully Open, Vision-Centric Exploration of Multimodal LLMs

Paper • 2406.16860 • Published Jun 24 • 57

LEDITS: Real Image Editing with DDPM Inversion and Semantic Guidance

Paper • 2307.00522 • Published Jul 2, 2023 • 32

Learning and Leveraging World Models in Visual Representation Learning

Paper • 2403.00504 • Published Mar 1 • 31

GAIA: a benchmark for General AI Assistants

Paper • 2311.12983 • Published Nov 21, 2023 • 184

Training Transformers Together

Paper • 2207.03481 • Published Jul 7, 2022 • 5

Probing the 3D Awareness of Visual Foundation Models

Paper • 2404.08636 • Published Apr 12 • 12

Learning Action and Reasoning-Centric Image Editing from Videos and Simulations

Paper • 2407.03471 • Published Jul 3 • 28

upvoted an article 25 days ago

Article

Breaking Barriers: The Critical Role of Art and Design in Advancing AI Capabilities

•

Jan 15

• 3

upvoted 4 papers 25 days ago

The FineWeb Datasets: Decanting the Web for the Finest Text Data at Scale

Paper • 2406.17557 • Published Jun 25 • 86

LEDITS++: Limitless Image Editing using Text-to-Image Models

Paper • 2311.16711 • Published Nov 28, 2023 • 22

TripoSR: Fast 3D Object Reconstruction from a Single Image

Paper • 2403.02151 • Published Mar 4 • 12

Stable Audio Open

Paper • 2407.14358 • Published Jul 19 • 23