CBT-Bench: Evaluating Large Language Models on Assisting Cognitive Behavior Therapy Paper • 2410.13218 • Published 15 days ago • 4
TemporalBench: Benchmarking Fine-grained Temporal Understanding for Multimodal Video Models Paper • 2410.10818 • Published 18 days ago • 14
Scaling Up Your Kernels: Large Kernel Design in ConvNets towards Universal Representations Paper • 2410.08049 • Published 22 days ago • 8
Towards World Simulator: Crafting Physical Commonsense-Based Benchmark for Video Generation Paper • 2410.05363 • Published 25 days ago • 44
Depth Pro: Sharp Monocular Metric Depth in Less Than a Second Paper • 2410.02073 • Published 30 days ago • 39
Learning the Latent Rules of a Game from Data: A Chess Story Paper • 2410.02426 • Published 29 days ago • 5
Self-Supervised Any-Point Tracking by Contrastive Random Walks Paper • 2409.16288 • Published Sep 24 • 5
Evaluating Multiview Object Consistency in Humans and Image Models Paper • 2409.05862 • Published Sep 9 • 8