jizhongpeng's picture

jizhongpeng

jizhongpeng

·

AI & ML interests

None yet

Recent Activity

upvoted a paper about 13 hours ago

liked a model 2 days ago

NexaAIDev/omnivision-968M

liked a model 3 days ago

q-future/VQA-Assistant

Organizations

jizhongpeng's activity

upvoted a paper about 13 hours ago

VideoAutoArena: An Automated Arena for Evaluating Large Multimodal Models in Video Analysis through User Simulation

Paper • 2411.13281 • Published 1 day ago • 15

upvoted a paper about 1 month ago

Aria: An Open Multimodal Native Mixture-of-Experts Model

Paper • 2410.05993 • Published Oct 8 • 107

upvoted a collection about 2 months ago

🏆 Leaderboards & Arenas 排行榜和评测基准

19 items • Updated 5 days ago • 5

upvoted a collection 3 months ago

Qwen2-VL

Vision-language model series based on Qwen2 • 15 items • Updated Sep 18 • 156

upvoted 4 papers 3 months ago

K-Sort Arena: Efficient and Reliable Benchmarking for Generative Models via K-wise Human Preferences

Paper • 2408.14468 • Published Aug 26 • 35

Towards flexible perception with visual memory

Paper • 2408.08172 • Published Aug 15 • 20

FRAP: Faithful and Realistic Text-to-Image Generation with Adaptive Prompt Weighting

Paper • 2408.11706 • Published Aug 21 • 6

LLaVA-OneVision: Easy Visual Task Transfer

Paper • 2408.03326 • Published Aug 6 • 59

upvoted a paper 4 months ago

Q-Ground: Image Quality Grounding with Large Multi-modality Models

Paper • 2407.17035 • Published Jul 24 • 1

upvoted a collection 4 months ago

Magpie-Qwen2 Datasets

Dataset built with Qwen2 72B and Qwen2 7B. • 6 items • Updated Sep 14 • 10

upvoted a paper 4 months ago

LongVideoBench: A Benchmark for Long-context Interleaved Video-Language Understanding

Paper • 2407.15754 • Published Jul 22 • 19

upvoted a collection 4 months ago

🪐 SmolLM

A series of smol LLMs: 135M, 360M and 1.7B. We release base and Instruct models as well as the training corpus and some WebGPU demos • 12 items • Updated Aug 18 • 198

upvoted a paper 4 months ago

MJ-Bench: Is Your Multimodal Reward Model Really a Good Judge for Text-to-Image Generation?

Paper • 2407.04842 • Published Jul 5 • 52

upvoted a collection 5 months ago

InternVL 2.0

Expanding Performance Boundaries of Open-Source MLLM • 16 items • Updated 1 day ago • 76

upvoted 3 papers 5 months ago

CMC-Bench: Towards a New Paradigm of Visual Signal Compression

Paper • 2406.09356 • Published Jun 13 • 4

MMWorld: Towards Multi-discipline Multi-faceted World Model Evaluation in Videos

Paper • 2406.08407 • Published Jun 12 • 24

A-Bench: Are LMMs Masters at Evaluating AI-generated Images?

Paper • 2406.03070 • Published Jun 5 • 2

upvoted a collection 5 months ago

MaPO

This collection includes the models and datasets as a part of the MaPO release. • 9 items • Updated Jun 12 • 5

upvoted 2 collections 6 months ago

Qwen2

Qwen2 language models, including pretrained and instruction-tuned models of 5 sizes, including 0.5B, 1.5B, 7B, 57B-A14B, and 72B. • 39 items • Updated Sep 18 • 347

GLM-4

GLM-4 Open Models • 13 items • Updated 28 days ago • 111