zhang zhixiong's picture

6 1

zhang zhixiong

rookiexiong

rookiexiong7

AI & ML interests

None yet

Recent Activity

liked a model about 7 hours ago

internlm/internlm-xcomposer2d5-ol-7b

upvoted a paper about 15 hours ago

InternLM-XComposer2.5-OmniLive: A Comprehensive Multimodal System for Long-term Streaming Video and Audio Interactions

upvoted a paper 9 days ago

Imagine360: Immersive 360 Video Generation from Perspective Anchor

View all activity

Organizations

None yet

rookiexiong's activity

liked a model about 7 hours ago

internlm/internlm-xcomposer2d5-ol-7b

Visual Question Answering • Updated about 11 hours ago • 13

upvoted a paper about 15 hours ago

InternLM-XComposer2.5-OmniLive: A Comprehensive Multimodal System for Long-term Streaming Video and Audio Interactions

Paper • 2412.09596 • Published 1 day ago • 58

upvoted a paper 9 days ago

Imagine360: Immersive 360 Video Generation from Perspective Anchor

Paper • 2412.03552 • Published 9 days ago • 26

upvoted a paper 10 days ago

X-Prompt: Towards Universal In-Context Image Generation in Auto-Regressive Vision Language Foundation Models

Paper • 2412.01824 • Published 11 days ago • 61

upvoted 3 papers about 2 months ago

MIA-DPO: Multi-Image Augmented Direct Preference Optimization For Large Vision-Language Models

Paper • 2410.17637 • Published Oct 23 • 34

PyramidDrop: Accelerating Your Large Vision-Language Models via Pyramid Visual Redundancy Reduction

Paper • 2410.17247 • Published Oct 22 • 45

SAM2Long: Enhancing SAM 2 for Long Video Segmentation with a Training-Free Memory Tree

Paper • 2410.16268 • Published Oct 21 • 65