To what extent are we responsible for our content and how to create safer Spaces? about 1 month ago β’ 1
Moshi v0.1 Release Collection MLX, Candle & PyTorch model checkpoints released as part of the Moshi release from Kyutai. Run inference via: https://github.com/kyutai-labs/moshi β’ 13 items β’ Updated 11 days ago β’ 192
view article Article Fine-tuning a token classification model for legal data using Argilla and AutoTrain By bikashpatra β’ 23 days ago β’ 11
view article Article Introducing AuraFace: Open-Source Face Recognition and Identity Preservation Models By isidentical β’ Aug 26 β’ 34
Gradio Annotators Collection It's not for team collaboration, nor trying to be all fancy and formal - just a bunch of cool tools to help you move to a more serious stage. β’ 14 items β’ Updated Aug 22 β’ 3
view article Article dstack: Your LLM Launchpad - From Fine-Tuning to Serving, Simplified By chansung β’ Aug 22 β’ 12
Probably DPO datasets Collection A collection of datasets that probably support DPO β’ 146 items β’ Updated Jun 26 β’ 12
Direct Preference Optimization Datasets Collection Datasets suitable for Direct Preference Optimization based on their colum names β’ 1597 items β’ Updated Jul 10 β’ 2
Image Preference Optimization Datasets Collection Datasets suitable for Image Preference Optimization based on their colum names β’ 4 items β’ Updated Jul 11 β’ 1
Llama 3.1 Collection This collection hosts the transformers and original repos of the Llama 3.1, Llama Guard 3 and Prompt Guard models β’ 11 items β’ Updated 4 days ago β’ 584
view article Article π₯ Argilla 2.0: the data-centric tool for AI makers π€ By dvilasuero β’ Jul 30 β’ 33
Argilla v2.0 compatible datasets Collection Ready for rg.Dataset.from_hub(). Each dataset contains a my_dataset_name/tree/main/creation_script.py to see the fullconfig and creation pipeline. β’ 7 items β’ Updated Aug 5 β’ 3
view article Article Experimenting with Automatic PII Detection on the Hub using Presidio Jul 10 β’ 23
view article Article Wikipedia's Treasure Trove: Advancing Machine Learning with Diverse Data By frimelle β’ Jun 3 β’ 13
view article Article βοΈ π₯ Building High-Quality Datasets with distilabel and Prometheus 2 By burtenshaw β’ Jun 3 β’ 26
KTO: Model Alignment as Prospect Theoretic Optimization Paper β’ 2402.01306 β’ Published Feb 2 β’ 14
Preference Datasets for KTO Collection This collection contains a list of curated preference datasets for KTO fine-tuning for intent alignment of LLMs through signals. β’ 5 items β’ Updated Jul 30 β’ 14
StepCoder: Improve Code Generation with Reinforcement Learning from Compiler Feedback Paper β’ 2402.01391 β’ Published Feb 2 β’ 41