Molmo and PixMo: Open Weights and Open Data for State-of-the-Art Multimodal Models Paper β’ 2409.17146 β’ Published Sep 25 β’ 103
StableSemantics: A Synthetic Language-Vision Dataset of Semantic Representations in Naturalistic Images Paper β’ 2406.13735 β’ Published Jun 19 β’ 5
The Devil is in the Details: StyleFeatureEditor for Detail-Rich StyleGAN Inversion and High Quality Image Editing Paper β’ 2406.10601 β’ Published Jun 15 β’ 65