Lexicon3D: Probing Visual Foundation Models for Complex 3D Scene Understanding Paper • 2409.03757 • Published Sep 5 • 2
Multi-task View Synthesis with Neural Radiance Fields Paper • 2309.17450 • Published Sep 29, 2023 • 3
ReferEverything: Towards Segmenting Everything We Can Speak of in Videos Paper • 2410.23287 • Published 6 days ago • 17
Video-STaR: Self-Training Enables Video Instruction Tuning with Any Supervision Paper • 2407.06189 • Published Jul 8 • 24