Dotanoob7's picture

1 10 1

Dotanoob7

Dotanoob

AI & ML interests

None yet

Recent Activity

upvoted a paper 3 days ago

Multimodal Autoregressive Pre-training of Large Vision Encoders

upvoted a paper 16 days ago

OmniEdit: Building Image Editing Generalist Models Through Specialist Supervision

upvoted a paper 16 days ago

LLM2CLIP: Powerful Language Model Unlock Richer Visual Representation

View all activity

Organizations

None yet

Dotanoob's activity

upvoted a paper 3 days ago

Multimodal Autoregressive Pre-training of Large Vision Encoders

Paper • 2411.14402 • Published 6 days ago • 36

upvoted 2 papers 16 days ago

OmniEdit: Building Image Editing Generalist Models Through Specialist Supervision

Paper • 2411.07199 • Published 16 days ago • 45

LLM2CLIP: Powerful Language Model Unlock Richer Visual Representation

Paper • 2411.04997 • Published 20 days ago • 35

upvoted a paper 17 days ago

OpenCoder: The Open Cookbook for Top-Tier Code Large Language Models

Paper • 2411.04905 • Published 20 days ago • 109

upvoted 6 papers about 1 month ago

Teach Multimodal LLMs to Comprehend Electrocardiographic Images

Paper • 2410.19008 • Published Oct 21 • 22

Qwen2-VL: Enhancing Vision-Language Model's Perception of the World at Any Resolution

Paper • 2409.12191 • Published Sep 18 • 74

NaturalBench: Evaluating Vision-Language Models on Natural Adversarial Samples

Paper • 2410.14669 • Published Oct 18 • 35

HumanEval-V: Evaluating Visual Understanding and Reasoning Abilities of Large Multimodal Models Through Coding Tasks

Paper • 2410.12381 • Published Oct 16 • 42

Omni-MATH: A Universal Olympiad Level Mathematic Benchmark For Large Language Models

Paper • 2410.07985 • Published Oct 10 • 26

MEGA-Bench: Scaling Multimodal Evaluation to over 500 Real-World Tasks

Paper • 2410.10563 • Published Oct 14 • 37