Best of Both Worlds: Advantages of Hybrid Graph Sequence Models Paper • 2411.15671 • Published 3 days ago • 5
Reflections from the 2024 Large Language Model (LLM) Hackathon for Applications in Materials Science and Chemistry Paper • 2411.15221 • Published 6 days ago • 11
Interactive Medical Image Segmentation: A Benchmark Dataset and Baseline Paper • 2411.12814 • Published 7 days ago • 10
Material Anything: Generating Materials for Any 3D Object via Diffusion Paper • 2411.15138 • Published 4 days ago • 35
Large-Scale Text-to-Image Model with Inpainting is a Zero-Shot Subject-Driven Image Generator Paper • 2411.15466 • Published 4 days ago • 28
Knowledge Transfer Across Modalities with Natural Language Supervision Paper • 2411.15611 • Published 3 days ago • 13
From Generation to Judgment: Opportunities and Challenges of LLM-as-a-judge Paper • 2411.16594 • Published 1 day ago • 19
DreamRunner: Fine-Grained Storytelling Video Generation with Retrieval-Augmented Motion Adaptation Paper • 2411.16657 • Published 1 day ago • 13
O1 Replication Journey -- Part 2: Surpassing O1-preview through Simple Distillation, Big Progress or Bitter Lesson? Paper • 2411.16489 • Published 1 day ago • 19
Cautious Optimizers: Improving Training with One Line of Code Paper • 2411.16085 • Published 2 days ago • 6
SegBook: A Simple Baseline and Cookbook for Volumetric Medical Image Segmentation Paper • 2411.14525 • Published 5 days ago • 8
OminiControl: Minimal and Universal Control for Diffusion Transformer Paper • 2411.15098 • Published 4 days ago • 34
BALROG: Benchmarking Agentic LLM and VLM Reasoning On Games Paper • 2411.13543 • Published 6 days ago • 16
VideoRepair: Improving Text-to-Video Generation via Misalignment Evaluation and Localized Refinement Paper • 2411.15115 • Published 4 days ago • 7