Unpacking SDXL Turbo: Interpreting Text-to-Image Models with Sparse Autoencoders Paper ā¢ 2410.22366 ā¢ Published 9 days ago ā¢ 71
Accurate Compression of Text-to-Image Diffusion Models via Vector Quantization Paper ā¢ 2409.00492 ā¢ Published Aug 31 ā¢ 11
XLand-100B: A Large-Scale Multi-Task Dataset for In-Context Reinforcement Learning Paper ā¢ 2406.08973 ā¢ Published Jun 13 ā¢ 85
Meta Llama 3 Collection This collection hosts the transformers and original repos of the Meta Llama 3 and Llama Guard 2 releases ā¢ 5 items ā¢ Updated Sep 25 ā¢ 680
GaLore: Memory-Efficient LLM Training by Gradient Low-Rank Projection Paper ā¢ 2403.03507 ā¢ Published Mar 6 ā¢ 182
Sparse Concept Bottleneck Models: Gumbel Tricks in Contrastive Learning Paper ā¢ 2404.03323 ā¢ Published Apr 4 ā¢ 3
Mamba: Linear-Time Sequence Modeling with Selective State Spaces Paper ā¢ 2312.00752 ā¢ Published Dec 1, 2023 ā¢ 138
Synth^2: Boosting Visual-Language Models with Synthetic Captions and Image Embeddings Paper ā¢ 2403.07750 ā¢ Published Mar 12 ā¢ 21
DeepSeek-VL: Towards Real-World Vision-Language Understanding Paper ā¢ 2403.05525 ā¢ Published Mar 8 ā¢ 39
VisionLLaMA: A Unified LLaMA Interface for Vision Tasks Paper ā¢ 2403.00522 ā¢ Published Mar 1 ā¢ 44
FiT: Flexible Vision Transformer for Diffusion Model Paper ā¢ 2402.12376 ā¢ Published Feb 19 ā¢ 48