-
Self-Rewarding Language Models
Paper ā¢ 2401.10020 ā¢ Published ā¢ 144 -
ReFT: Reasoning with Reinforced Fine-Tuning
Paper ā¢ 2401.08967 ā¢ Published ā¢ 28 -
Tuning Language Models by Proxy
Paper ā¢ 2401.08565 ā¢ Published ā¢ 21 -
TrustLLM: Trustworthiness in Large Language Models
Paper ā¢ 2401.05561 ā¢ Published ā¢ 65
Collections
Discover the best community collections!
Collections including paper arxiv:2312.03700
-
Weak-to-Strong Generalization: Eliciting Strong Capabilities With Weak Supervision
Paper ā¢ 2312.09390 ā¢ Published ā¢ 32 -
OneLLM: One Framework to Align All Modalities with Language
Paper ā¢ 2312.03700 ā¢ Published ā¢ 20 -
Generative Multimodal Models are In-Context Learners
Paper ā¢ 2312.13286 ā¢ Published ā¢ 34 -
The LLM Surgeon
Paper ā¢ 2312.17244 ā¢ Published ā¢ 9
-
OneLLM: One Framework to Align All Modalities with Language
Paper ā¢ 2312.03700 ā¢ Published ā¢ 20 -
Direct-a-Video: Customized Video Generation with User-Directed Camera Movement and Object Motion
Paper ā¢ 2402.03162 ā¢ Published ā¢ 17 -
Rolling Diffusion Models
Paper ā¢ 2402.09470 ā¢ Published ā¢ 10 -
AnyGPT: Unified Multimodal LLM with Discrete Sequence Modeling
Paper ā¢ 2402.12226 ā¢ Published ā¢ 41
-
Trusted Source Alignment in Large Language Models
Paper ā¢ 2311.06697 ā¢ Published ā¢ 10 -
Diffusion Model Alignment Using Direct Preference Optimization
Paper ā¢ 2311.12908 ā¢ Published ā¢ 47 -
SuperHF: Supervised Iterative Learning from Human Feedback
Paper ā¢ 2310.16763 ā¢ Published ā¢ 1 -
Enhancing Diffusion Models with Text-Encoder Reinforcement Learning
Paper ā¢ 2311.15657 ā¢ Published ā¢ 2
-
Random Field Augmentations for Self-Supervised Representation Learning
Paper ā¢ 2311.03629 ā¢ Published ā¢ 6 -
TEAL: Tokenize and Embed ALL for Multi-modal Large Language Models
Paper ā¢ 2311.04589 ā¢ Published ā¢ 18 -
GENOME: GenerativE Neuro-symbOlic visual reasoning by growing and reusing ModulEs
Paper ā¢ 2311.04901 ā¢ Published ā¢ 7 -
Q-Instruct: Improving Low-level Visual Abilities for Multi-modality Foundation Models
Paper ā¢ 2311.06783 ā¢ Published ā¢ 26
-
Levels of AGI: Operationalizing Progress on the Path to AGI
Paper ā¢ 2311.02462 ā¢ Published ā¢ 33 -
Ultra-Long Sequence Distributed Transformer
Paper ā¢ 2311.02382 ā¢ Published ā¢ 2 -
A Survey on Language Models for Code
Paper ā¢ 2311.07989 ā¢ Published ā¢ 21 -
GRIM: GRaph-based Interactive narrative visualization for gaMes
Paper ā¢ 2311.09213 ā¢ Published ā¢ 12
-
Attention Is All You Need
Paper ā¢ 1706.03762 ā¢ Published ā¢ 44 -
FlashAttention-2: Faster Attention with Better Parallelism and Work Partitioning
Paper ā¢ 2307.08691 ā¢ Published ā¢ 8 -
Mixtral of Experts
Paper ā¢ 2401.04088 ā¢ Published ā¢ 159 -
Mistral 7B
Paper ā¢ 2310.06825 ā¢ Published ā¢ 47
-
Woodpecker: Hallucination Correction for Multimodal Large Language Models
Paper ā¢ 2310.16045 ā¢ Published ā¢ 14 -
HallusionBench: You See What You Think? Or You Think What You See? An Image-Context Reasoning Benchmark Challenging for GPT-4V(ision), LLaVA-1.5, and Other Multi-modality Models
Paper ā¢ 2310.14566 ā¢ Published ā¢ 25 -
SILC: Improving Vision Language Pretraining with Self-Distillation
Paper ā¢ 2310.13355 ā¢ Published ā¢ 7 -
Conditional Diffusion Distillation
Paper ā¢ 2310.01407 ā¢ Published ā¢ 20
-
MiniGPT-v2: large language model as a unified interface for vision-language multi-task learning
Paper ā¢ 2310.09478 ā¢ Published ā¢ 19 -
Multimodal ChatGPT for Medical Applications: an Experimental Study of GPT-4V
Paper ā¢ 2310.19061 ā¢ Published ā¢ 8 -
OneLLM: One Framework to Align All Modalities with Language
Paper ā¢ 2312.03700 ā¢ Published ā¢ 20