-
CBT-Bench: Evaluating Large Language Models on Assisting Cognitive Behavior Therapy
Paper • 2410.13218 • Published • 4 -
TemporalBench: Benchmarking Fine-grained Temporal Understanding for Multimodal Video Models
Paper • 2410.10818 • Published • 14 -
Scaling Up Your Kernels: Large Kernel Design in ConvNets towards Universal Representations
Paper • 2410.08049 • Published • 8 -
Towards World Simulator: Crafting Physical Commonsense-Based Benchmark for Video Generation
Paper • 2410.05363 • Published • 44
Preston Mann
Hermeskid123
AI & ML interests
None yet
Organizations
None yet
Collections
1
models
None public yet
datasets
None public yet