arxiv:2411.10323
Mike Zheng Shou
AnalMom
AI & ML interests
None yet
Recent Activity
authored
a paper
6 days ago
The Dawn of GUI Agent: A Preliminary Case Study with Claude 3.5 Computer
Use
authored
a paper
about 1 month ago
EvolveDirector: Approaching Advanced Text-to-Image Generation with Large
Vision-Language Models
authored
a paper
about 2 months ago
One Token to Seg Them All: Language Instructed Reasoning Segmentation in
Videos
Organizations
None yet
models
None public yet
datasets
None public yet