liuzuyan's picture

7 7

liuzuyan

Zuyan

·

liuzuyan

AI & ML interests

None yet

Recent Activity

authored a paper 6 days ago

Insight-V: Exploring Long-Chain Visual Reasoning with Multimodal Large Language Models

upvoted a paper 6 days ago

Insight-V: Exploring Long-Chain Visual Reasoning with Multimodal Large Language Models

upvoted a paper 23 days ago

Hunyuan-Large: An Open-Source MoE Model with 52 Billion Activated Parameters by Tencent

View all activity

Organizations

None yet

Zuyan's activity

authored a paper 6 days ago

Insight-V: Exploring Long-Chain Visual Reasoning with Multimodal Large Language Models

Paper • 2411.14432 • Published 6 days ago • 19

upvoted a paper 6 days ago

Insight-V: Exploring Long-Chain Visual Reasoning with Multimodal Large Language Models

Paper • 2411.14432 • Published 6 days ago • 19

upvoted a paper 23 days ago

Hunyuan-Large: An Open-Source MoE Model with 52 Billion Activated Parameters by Tencent

Paper • 2411.02265 • Published 23 days ago • 24

Reacted to THUdyh's post with 🔥 about 1 month ago

Post

3158

🔥🔥🔥Introducing Oryx-1.5!
A series of unified MLLMs with much stronger performance on all the image, video, and 3D benchmarks 😍
🛠️Github: https://github.com/Oryx-mllm/Oryx
🚀Model: THUdyh/oryx-15-6718c60763845525c2bba71d
🎨Demo: THUdyh/Oryx
👋Try the top-tier MLLM yourself!

👀Stay tuned for more explorations on MLLMs!

upvoted a paper 2 months ago

Molmo and PixMo: Open Weights and Open Data for State-of-the-Art Multimodal Models

Paper • 2409.17146 • Published Sep 25 • 103

liked a Space 2 months ago

Running on Zero

Oryx

liked 5 models 2 months ago

THUdyh/Oryx-ViT

Image Classification • Updated Sep 23 • 5

THUdyh/Oryx-34B-Image

Text Generation • Updated Sep 23 • 19 • 2

THUdyh/Oryx-7B-Image

Text Generation • Updated Sep 23 • 12 • 3

THUdyh/Oryx-34B

Text Generation • Updated Sep 23 • 46 • 3

THUdyh/Oryx-7B

Text Generation • Updated Sep 25 • 276 • 11

authored 2 papers 2 months ago

Unleashing Text-to-Image Diffusion Models for Visual Perception

Paper • 2303.02153 • Published Mar 3, 2023

Oryx MLLM: On-Demand Spatial-Temporal Understanding at Arbitrary Resolution

Paper • 2409.12961 • Published Sep 19 • 24

upvoted 2 papers 2 months ago

DC-Solver: Improving Predictor-Corrector Diffusion Sampler via Dynamic Compensation

Paper • 2409.03755 • Published Sep 5 • 3

Oryx MLLM: On-Demand Spatial-Temporal Understanding at Arbitrary Resolution

Paper • 2409.12961 • Published Sep 19 • 24

updated a dataset 3 months ago

Zuyan/video

Updated Aug 27 • 16

upvoted 2 papers 4 months ago

Coarse Correspondence Elicit 3D Spacetime Understanding in Multimodal Language Model

Paper • 2408.00754 • Published Aug 1 • 21

Efficient Inference of Vision Instruction-Following Models with Elastic Cache

Paper • 2407.18121 • Published Jul 25 • 16

authored a paper 4 months ago

Efficient Inference of Vision Instruction-Following Models with Elastic Cache

Paper • 2407.18121 • Published Jul 25 • 16

updated a model 4 months ago

Zuyan/ElasticCache