LiveBench: A Challenging, Contamination-Free LLM Benchmark Paper • 2406.19314 • Published Jun 27 • 19
Transformers Can Do Arithmetic with the Right Embeddings Paper • 2405.17399 • Published May 27 • 51
Spotting LLMs With Binoculars: Zero-Shot Detection of Machine-Generated Text Paper • 2401.12070 • Published Jan 22 • 43
Perspectives on the State and Future of Deep Learning -- 2023 Paper • 2312.09323 • Published Dec 7, 2023 • 5
Battle of the Backbones: A Large-Scale Comparison of Pretrained Models across Computer Vision Tasks Paper • 2310.19909 • Published Oct 30, 2023 • 20
Bring Your Own Data! Self-Supervised Evaluation for Large Language Models Paper • 2306.13651 • Published Jun 23, 2023 • 15
InstructZero: Efficient Instruction Optimization for Black-Box Large Language Models Paper • 2306.03082 • Published Jun 5, 2023 • 5
Understanding and Mitigating Copying in Diffusion Models Paper • 2305.20086 • Published May 31, 2023 • 3
Tree-Ring Watermarks: Fingerprints for Diffusion Images that are Invisible and Robust Paper • 2305.20030 • Published May 31, 2023 • 8