VLMs for 3D reconstructions and their evaluation - a cviitm Collection

cviitm 's Collections

VLMs for 3D reconstructions and their evaluation

VLMs for 3D reconstructions and their evaluation

updated Dec 5, 2023

List of papers to help with developing a model that reviews a photogrammetry scan and evaluates its quality

ImageBind: One Embedding Space To Bind Them All

Paper • 2305.05665 • Published May 9, 2023 • 3
ZoeDepth: Zero-shot Transfer by Combining Relative and Metric Depth

Paper • 2302.12288 • Published Feb 23, 2023
HuggingFaceM4/howto100m

Updated May 18, 2022 • 38 • 4
BLIP: Bootstrapping Language-Image Pre-training for Unified Vision-Language Understanding and Generation

Paper • 2201.12086 • Published Jan 28, 2022 • 3
sayakpaul/nyu_depth_v2

Viewer • Updated Dec 12, 2022 • 3.75k • 836 • 25
iejMac/CLIP-MSR-VTT

Preview • Updated Oct 31, 2022 • 58
playgroundai/blip_clipseg_inpainting_ip2p_data_test

Viewer • Updated Feb 8, 2023 • 825 • 46 • 4
Improved Baselines with Visual Instruction Tuning

Paper • 2310.03744 • Published Oct 5, 2023 • 37
Multimodal Foundation Models: From Specialists to General-Purpose Assistants

Paper • 2309.10020 • Published Sep 18, 2023 • 40
The Dawn of LMMs: Preliminary Explorations with GPT-4V(ision)

Paper • 2309.17421 • Published Sep 29, 2023 • 4
Set-of-Mark Prompting Unleashes Extraordinary Visual Grounding in GPT-4V

Paper • 2310.11441 • Published Oct 17, 2023 • 26