AUTOHALLUSION: Automatic Generation of Hallucination Benchmarks for Vision-Language Models Paper • 2406.10900 • Published 15 days ago • 4 • 4
MotionBooth: Motion-Aware Customized Text-to-Video Generation Paper • 2406.17758 • Published 5 days ago • 15 • 1
Stylebreeder: Exploring and Democratizing Artistic Styles through Text-to-Image Models Paper • 2406.14599 • Published 10 days ago • 16 • 2
ExVideo: Extending Video Diffusion Models via Parameter-Efficient Post-Tuning Paper • 2406.14130 • Published 11 days ago • 10 • 2
JEN-1 DreamStyler: Customized Musical Concept Learning via Pivotal Parameters Tuning Paper • 2406.12292 • Published 13 days ago • 4 • 2
VoCo-LLaMA: Towards Vision Compression with Large Language Models Paper • 2406.12275 • Published 13 days ago • 28 • 10
ChatGLM: A Family of Large Language Models from GLM-130B to GLM-4 All Tools Paper • 2406.12793 • Published 12 days ago • 26 • 2
OlympicArena: Benchmarking Multi-discipline Cognitive Reasoning for Superintelligent AI Paper • 2406.12753 • Published 13 days ago • 14 • 2
DeepSeek-Coder-V2: Breaking the Barrier of Closed-Source Models in Code Intelligence Paper • 2406.11931 • Published 14 days ago • 54 • 3
Training-free Camera Control for Video Generation Paper • 2406.10126 • Published 17 days ago • 11 • 2
CS-Bench: A Comprehensive Benchmark for Large Language Models towards Computer Science Mastery Paper • 2406.08587 • Published 18 days ago • 14 • 4
Test of Time: A Benchmark for Evaluating LLMs on Temporal Reasoning Paper • 2406.09170 • Published 18 days ago • 23 • 1
Interpreting the Weight Space of Customized Diffusion Models Paper • 2406.09413 • Published 17 days ago • 18 • 1
OpenVLA: An Open-Source Vision-Language-Action Model Paper • 2406.09246 • Published 18 days ago • 28 • 1
An Image is Worth More Than 16x16 Patches: Exploring Transformers on Individual Pixels Paper • 2406.09415 • Published 17 days ago • 47 • 2
HelpSteer2: Open-source dataset for training top-performing reward models Paper • 2406.08673 • Published 18 days ago • 14 • 3
Mistral-C2F: Coarse to Fine Actor for Analytical and Reasoning Enhancement in RLHF and Effective-Merged LLMs Paper • 2406.08657 • Published 18 days ago • 9 • 2
DiTFastAttn: Attention Compression for Diffusion Transformer Models Paper • 2406.08552 • Published 18 days ago • 20 • 1