MaskBit: Embedding-free Image Generation via Bit Tokens Paper • 2409.16211 • Published Sep 24 • 16
BaichuanSEED: Sharing the Potential of ExtensivE Data Collection and Deduplication by Introducing a Competitive Large Language Model Baseline Paper • 2408.15079 • Published Aug 27 • 52
MobileLLM: Optimizing Sub-billion Parameter Language Models for On-Device Use Cases Paper • 2402.14905 • Published Feb 22 • 126
OneBit: Towards Extremely Low-bit Large Language Models Paper • 2402.11295 • Published Feb 17 • 23
Mamba: Linear-Time Sequence Modeling with Selective State Spaces Paper • 2312.00752 • Published Dec 1, 2023 • 138
UFOGen: You Forward Once Large Scale Text-to-Image Generation via Diffusion GANs Paper • 2311.09257 • Published Nov 14, 2023 • 45
Latent Consistency Models: Synthesizing High-Resolution Images with Few-Step Inference Paper • 2310.04378 • Published Oct 6, 2023 • 19
QA-LoRA: Quantization-Aware Low-Rank Adaptation of Large Language Models Paper • 2309.14717 • Published Sep 26, 2023 • 44
Emergence of Segmentation with Minimalistic White-Box Transformers Paper • 2308.16271 • Published Aug 30, 2023 • 13
BOOT: Data-free Distillation of Denoising Diffusion Models with Bootstrapping Paper • 2306.05544 • Published Jun 8, 2023 • 10
Divide & Bind Your Attention for Improved Generative Semantic Nursing Paper • 2307.10864 • Published Jul 20, 2023 • 2
Wuerstchen: Efficient Pretraining of Text-to-Image Models Paper • 2306.00637 • Published Jun 1, 2023 • 12