CogVLM2: Visual Language Models for Image and Video Understanding Paper • 2408.16500 • Published 20 days ago • 55
Learning to Move Like Professional Counter-Strike Players Paper • 2408.13934 • Published 23 days ago • 21
Building and better understanding vision-language models: insights and future directions Paper • 2408.12637 • Published 26 days ago • 109
Towards a Unified View of Preference Learning for Large Language Models: A Survey Paper • 2409.02795 • Published 13 days ago • 67
Windows Agent Arena: Evaluating Multi-Modal OS Agents at Scale Paper • 2409.08264 • Published 5 days ago • 35