Studying Large Language Model Generalization with Influence Functions Paper • 2308.03296 • Published Aug 7, 2023 • 12
Procedural Knowledge in Pretraining Drives Reasoning in Large Language Models Paper • 2411.12580 • Published 2 days ago • 1
LLaVA-o1: Let Vision Language Models Reason Step-by-Step Paper • 2411.10440 • Published 6 days ago • 87
1-bit AI Infra: Part 1.1, Fast and Lossless BitNet b1.58 Inference on CPUs Paper • 2410.16144 • Published Oct 21 • 2
Neural Tangent Kernel: Convergence and Generalization in Neural Networks Paper • 1806.07572 • Published Jun 20, 2018 • 1
LLaMA-Mesh: Unifying 3D Mesh Generation with Language Models Paper • 2411.09595 • Published 7 days ago • 65
BrushNet: A Plug-and-Play Image Inpainting Model with Decomposed Dual-Branch Diffusion Paper • 2403.06976 • Published Mar 11 • 2
MagicQuill: An Intelligent Interactive Image Editing System Paper • 2411.09703 • Published 7 days ago • 50
Faster Algorithms for Text-to-Pattern Hamming Distances Paper • 2310.13174 • Published Oct 19, 2023 • 1
Physics of Language Models: Part 3.2, Knowledge Manipulation Paper • 2309.14402 • Published Sep 25, 2023 • 6
Physics of Language Models: Part 2.1, Grade-School Math and the Hidden Reasoning Process Paper • 2407.20311 • Published Jul 29 • 4
GSM-Symbolic: Understanding the Limitations of Mathematical Reasoning in Large Language Models Paper • 2410.05229 • Published Oct 7 • 18