wongyukim

kimwongyuda

AI & ML interests

None yet

Recent Activity

upvoted a paper about 17 hours ago

Patience Is The Key to Large Language Model Reasoning

upvoted a paper about 17 hours ago

Insight-V: Exploring Long-Chain Visual Reasoning with Multimodal Large Language Models

upvoted a paper about 17 hours ago

Natural Language Reinforcement Learning

View all activity

Organizations

None yet

wongyukim's activity

upvoted 6 papers about 17 hours ago

Patience Is The Key to Large Language Model Reasoning

Paper • 2411.13082 • Published 5 days ago • 5

Insight-V: Exploring Long-Chain Visual Reasoning with Multimodal Large Language Models

Paper • 2411.14432 • Published 3 days ago • 16

Enhancing the Reasoning Ability of Multimodal Large Language Models via Mixed Preference Optimization

Paper • 2411.10442 • Published 9 days ago • 53

upvoted 7 papers 3 days ago

Marco-o1: Towards Open Reasoning Models for Open-Ended Solutions

Paper • 2411.14405 • Published 3 days ago • 35

Is Your LLM Secretly a World Model of the Internet? Model-Based Planning for Web Agents

Paper • 2411.06559 • Published 14 days ago • 10

When Precision Meets Position: BFloat16 Breaks Down RoPE in Long-Context Training

Paper • 2411.13476 • Published 4 days ago • 12

VideoAutoArena: An Automated Arena for Evaluating Large Multimodal Models in Video Analysis through User Simulation

Paper • 2411.13281 • Published 4 days ago • 15

VBench++: Comprehensive and Versatile Benchmark Suite for Video Generative Models

Paper • 2411.13503 • Published 4 days ago • 24

SageAttention2 Technical Report: Accurate 4 Bit Attention for Plug-and-play Inference Acceleration

Paper • 2411.10958 • Published 8 days ago • 44

MM-Embed: Universal Multimodal Retrieval with Multimodal LLMs

Paper • 2411.02571 • Published 20 days ago • 1

upvoted 4 papers 4 days ago

SEAGULL: No-reference Image Quality Assessment for Regions of Interest via Vision-Language Instruction Tuning

Paper • 2411.10161 • Published 9 days ago • 6

ITACLIP: Boosting Training-Free Semantic Segmentation with Image, Text, and Architectural Enhancements

Paper • 2411.12044 • Published 6 days ago • 13

RedPajama: an Open Dataset for Training Large Language Models

Paper • 2411.12372 • Published 6 days ago • 44

SymDPO: Boosting In-Context Learning of Large Multimodal Models with Symbol Demonstration Direct Preference Optimization

Paper • 2411.11909 • Published 8 days ago • 20

upvoted 2 papers 5 days ago

Drowning in Documents: Consequences of Scaling Reranker Inference

Paper • 2411.11767 • Published 6 days ago • 16

BlueLM-V-3B: Algorithm and System Co-Design for Multimodal Large Language Models on Mobile Devices

Paper • 2411.10640 • Published 9 days ago • 39

upvoted a paper 6 days ago

The Dawn of GUI Agent: A Preliminary Case Study with Claude 3.5 Computer Use

Paper • 2411.10323 • Published 9 days ago • 27