2 1 2

Tom Goldstein

tomgoldstein

AI & ML interests

None yet

Recent Activity

liked a Space about 1 month ago

tomg-group-umd/CinePileLeaderboard

liked a model about 1 month ago

mfarre/Video-LLaVA-7B-hf-CinePile

View all activity

Organizations

tomgoldstein's activity

liked a Space about 1 month ago

Running

🔥

CinePileLeaderboard

Video-LLM evaluations on CinePile's evaluation split.

liked a model about 1 month ago

mfarre/Video-LLaVA-7B-hf-CinePile

Video-Text-to-Text • Updated Aug 24 • 13 • 29

authored a paper 5 months ago

LiveBench: A Challenging, Contamination-Free LLM Benchmark

Paper • 2406.19314 • Published Jun 27 • 19

authored a paper 6 months ago

Transformers Can Do Arithmetic with the Right Embeddings

Paper • 2405.17399 • Published May 27 • 51

authored 2 papers 10 months ago

ODIN: Disentangled Reward Mitigates Hacking in RLHF

Paper • 2402.07319 • Published Feb 11 • 13

Spotting LLMs With Binoculars: Zero-Shot Detection of Machine-Generated Text

Paper • 2401.12070 • Published Jan 22 • 43

authored a paper 11 months ago

Perspectives on the State and Future of Deep Learning -- 2023

Paper • 2312.09323 • Published Dec 7, 2023 • 5

authored a paper 12 months ago

Object Recognition as Next Token Prediction

Paper • 2312.02142 • Published Dec 4, 2023 • 11

authored a paper about 1 year ago

Battle of the Backbones: A Large-Scale Comparison of Pretrained Models across Computer Vision Tasks

Paper • 2310.19909 • Published Oct 30, 2023 • 20

authored 4 papers over 1 year ago

Bring Your Own Data! Self-Supervised Evaluation for Large Language Models

Paper • 2306.13651 • Published Jun 23, 2023 • 15

InstructZero: Efficient Instruction Optimization for Black-Box Large Language Models

Paper • 2306.03082 • Published Jun 5, 2023 • 5

Understanding and Mitigating Copying in Diffusion Models

Paper • 2305.20086 • Published May 31, 2023 • 3

Tree-Ring Watermarks: Fingerprints for Diffusion Images that are Invisible and Robust

Paper • 2305.20030 • Published May 31, 2023 • 8