HyenaDNA Models Collection HyenaDNA models usable directly with Hugging Face classes like AutoModel. • 8 items • Updated Nov 14, 2023 • 15
LLaVA-o1: Let Vision Language Models Reason Step-by-Step Paper • 2411.10440 • Published 4 days ago • 69
Qwen 2.5 Coder Collection Complete collection of Code-specific model series for Qwen2.5 in bnb 4bit, 16bit and GGUF formats. • 35 items • Updated 4 days ago • 17
Qwen2.5-Coder Collection Code-specific model series based on Qwen2.5 • 40 items • Updated about 22 hours ago • 209
OpenCoder: The Open Cookbook for Top-Tier Code Large Language Models Paper • 2411.04905 • Published 12 days ago • 105
view article Article Glaze and the Effectiveness of Anti-AI Methods for Diffusion Models By parsee-mizuhashi • May 15 • 7
PoSE: Efficient Context Window Extension of LLMs via Positional Skip-wise Training Paper • 2309.10400 • Published Sep 19, 2023 • 26
"Give Me BF16 or Give Me Death"? Accuracy-Performance Trade-Offs in LLM Quantization Paper • 2411.02355 • Published 15 days ago • 44
Florence-2: Advancing a Unified Representation for a Variety of Vision Tasks Paper • 2311.06242 • Published Nov 10, 2023 • 84
Beyond A*: Better Planning with Transformers via Search Dynamics Bootstrapping Paper • 2402.14083 • Published Feb 21 • 47
MobileLLM Collection Optimizing Sub-billion Parameter Language Models for On-Device Use Cases (ICML 2024) https://arxiv.org/abs/2402.14905 • 8 items • Updated 12 days ago • 95
MoE Girl Collection The MoE Girl series of small, sparse roleplay models • 3 items • Updated 23 days ago • 2
RPMax v1 Models Collection RPMax series of models with higher creativity and reduced repetition for "classic" RP chats. • 15 items • Updated about 13 hours ago • 15
Ichigo: Mixed-Modal Early-Fusion Realtime Voice Assistant Paper • 2410.15316 • Published about 1 month ago • 10
AutoTrain: No-code training for state-of-the-art models Paper • 2410.15735 • Published 29 days ago • 57
view article Article Advanced Flux Dreambooth LoRA Training with 🧨 diffusers By linoyts • 29 days ago • 27