Ye Fang's picture

10 6

Ye Fang

aleafy

·

AI & ML interests

None yet

Organizations

None yet

aleafy's activity

upvoted a paper 9 days ago

MIA-DPO: Multi-Image Augmented Direct Preference Optimization For Large Vision-Language Models

Paper • 2410.17637 • Published 11 days ago • 34

upvoted a paper 11 days ago

PyramidDrop: Accelerating Your Large Vision-Language Models via Pyramid Visual Redundancy Reduction

Paper • 2410.17247 • Published 11 days ago • 43

upvoted a paper 12 days ago

SAM2Long: Enhancing SAM 2 for Long Video Segmentation with a Training-Free Memory Tree

Paper • 2410.16268 • Published 12 days ago • 65

upvoted a paper 4 months ago

InternLM-XComposer-2.5: A Versatile Large Vision Language Model Supporting Long-Contextual Input and Output

Paper • 2407.03320 • Published Jul 3 • 92

upvoted a paper 6 months ago

Make-it-Real: Unleashing Large Multimodal Model's Ability for Painting 3D Objects with Realistic Materials

Paper • 2404.16829 • Published Apr 25 • 5

upvoted a paper 9 months ago

InternLM-XComposer2: Mastering Free-form Text-Image Composition and Comprehension in Vision-Language Large Model

Paper • 2401.16420 • Published Jan 29 • 54

upvoted 2 papers 10 months ago

GPT-4V(ision) is a Human-Aligned Evaluator for Text-to-3D Generation

Paper • 2401.04092 • Published Jan 8 • 20

Gemini vs GPT-4V: A Preliminary Comparison and Combination of Vision-Language Models Through Qualitative Cases

Paper • 2312.15011 • Published Dec 22, 2023 • 15

upvoted 2 papers 11 months ago

Alpha-CLIP: A CLIP Model Focusing on Wherever You Want

Paper • 2312.03818 • Published Dec 6, 2023 • 32

GPT4Point: A Unified Framework for Point-Language Understanding and Generation

Paper • 2312.02980 • Published Dec 5, 2023 • 7