Windows Agent Arena: Evaluating Multi-Modal OS Agents at Scale Paper • 2409.08264 • Published 20 days ago • 42
WebInstruct 🌐 Embeddings 🧱 Models Collection A collection of SoTA embeddings model fine-tuned on WebInstruct dataset to learn to pair instructions with its responses • 3 items • Updated 28 days ago • 11
view article Article Selective fine-tuning of Language Models with Spectrum By anakin87 • 29 days ago • 26
💻 Local SmolLMs Collection SmolLM models in MLC, ONNX and GGUF format for local applications + in-browser demos • 14 items • Updated Aug 20 • 41
Mixture-of-preference-reward-modeling Collection The mixture of preference datasets used for reward modeling. • 2 items • Updated Apr 29 • 2
Standard-format-preference-dataset Collection We collect the open-source datasets and process them into the standard format. • 14 items • Updated May 8 • 19
Preference Datasets for DPO Collection This collection contains a list of curated preference datasets for DPO fine-tuning for intent alignment of LLMs • 7 items • Updated Jul 30 • 28
An Emulator for Fine-Tuning Large Language Models using Small Language Models Paper • 2310.12962 • Published Oct 19, 2023 • 14
🚂 SD-XL Training Suite Collection All the steps to train your own SD-XL custom model • 7 items • Updated Jun 11 • 20
HyperDreamBooth: HyperNetworks for Fast Personalization of Text-to-Image Models Paper • 2307.06949 • Published Jul 13, 2023 • 50