liuzuyan's picture

7 7

liuzuyan

Zuyan

·

liuzuyan

AI & ML interests

None yet

Recent Activity

authored a paper 6 days ago

Insight-V: Exploring Long-Chain Visual Reasoning with Multimodal Large Language Models

upvoted a paper 6 days ago

Insight-V: Exploring Long-Chain Visual Reasoning with Multimodal Large Language Models

upvoted a paper 23 days ago

Hunyuan-Large: An Open-Source MoE Model with 52 Billion Activated Parameters by Tencent

View all activity

Organizations

None yet

Zuyan's activity

upvoted a paper 6 days ago

Insight-V: Exploring Long-Chain Visual Reasoning with Multimodal Large Language Models

Paper • 2411.14432 • Published 6 days ago • 18

upvoted a paper 23 days ago

Hunyuan-Large: An Open-Source MoE Model with 52 Billion Activated Parameters by Tencent

Paper • 2411.02265 • Published 23 days ago • 24

upvoted 3 papers 2 months ago

Molmo and PixMo: Open Weights and Open Data for State-of-the-Art Multimodal Models

Paper • 2409.17146 • Published Sep 25 • 103

DC-Solver: Improving Predictor-Corrector Diffusion Sampler via Dynamic Compensation

Paper • 2409.03755 • Published Sep 5 • 3

Oryx MLLM: On-Demand Spatial-Temporal Understanding at Arbitrary Resolution

Paper • 2409.12961 • Published Sep 19 • 24

upvoted 2 papers 4 months ago

Coarse Correspondence Elicit 3D Spacetime Understanding in Multimodal Language Model

Paper • 2408.00754 • Published Aug 1 • 21

Efficient Inference of Vision Instruction-Following Models with Elastic Cache

Paper • 2407.18121 • Published Jul 25 • 16