Rui Sun's picture

6 1

Rui Sun

ThreeSR

·

https://threesr.github.io/

AI & ML interests

Vision and Language Multimodal Learning, CV, NLP, LLM

Recent Activity

upvoted a paper 3 minutes ago

DINO-X: A Unified Vision Model for Open-World Object Detection and Understanding

upvoted a paper 20 days ago

Training-free Regional Prompting for Diffusion Transformers

upvoted a paper 20 days ago

How Far is Video Generation from World Model: A Physical Law Perspective

View all activity

Organizations

ThreeSR's activity

upvoted a paper 3 minutes ago

DINO-X: A Unified Vision Model for Open-World Object Detection and Understanding

Paper • 2411.14347 • Published 4 days ago • 8

upvoted 2 papers 20 days ago

Training-free Regional Prompting for Diffusion Transformers

Paper • 2411.02395 • Published 21 days ago • 24

How Far is Video Generation from World Model: A Physical Law Perspective

Paper • 2411.02385 • Published 21 days ago • 32

upvoted a paper 21 days ago

OS-ATLAS: A Foundation Action Model for Generalist GUI Agents

Paper • 2410.23218 • Published 26 days ago • 46

upvoted a paper 26 days ago

Flow-DPO: Improving LLM Mathematical Reasoning through Online Multi-Agent Learning

Paper • 2410.22304 • Published 27 days ago • 15

upvoted a paper about 1 year ago

GENOME: GenerativE Neuro-symbOlic visual reasoning by growing and reusing ModulEs

Paper • 2311.04901 • Published Nov 8, 2023 • 7