109 13 25

Raushan Turganbay

RaushanTurganbay

zucchini-nlp

AI & ML interests

Generation and Multimodality

Articles

Unlocking Longer Generation with Key-Value Cache Quantization

May 16

• 28

Organizations

RaushanTurganbay's activity

upvoted a collection 5 days ago

Molmo

Collection

Artifacts for open multimodal language models. • 5 items • Updated 5 days ago • 198

upvoted an article 21 days ago

Article

Key Insights into the Law of Vision Representations in MLLMs

•

29 days ago

• 16

upvoted a paper 21 days ago

Paper Copilot: A Self-Evolving and Efficient LLM System for Personalized Academic Assistance

Paper • 2409.04593 • Published 25 days ago • 20

upvoted a collection about 1 month ago

Vision Language Models Papers 🖼️💬📝

Collection

Papers about vision-language models, most important ones are on top of the list. • 27 items • Updated Apr 30 • 32

upvoted an article about 2 months ago

Article

Introduction to ggml

Aug 13

• 96

upvoted 3 papers about 2 months ago

mPLUG-Owl3: Towards Long Image-Sequence Understanding in Multi-Modal Large Language Models

Paper • 2408.04840 • Published Aug 9 • 31

VITA: Towards Open-Source Interactive Omni Multimodal LLM

Paper • 2408.05211 • Published Aug 9 • 46

LLaVA-OneVision: Easy Visual Task Transfer

Paper • 2408.03326 • Published Aug 6 • 59

upvoted a paper 2 months ago

SlowFast-LLaVA: A Strong Training-Free Baseline for Video Large Language Models

Paper • 2407.15841 • Published Jul 22 • 38

upvoted 2 papers 3 months ago

Qwen2 Technical Report

Paper • 2407.10671 • Published Jul 15 • 153

Video-STaR: Self-Training Enables Video Instruction Tuning with Any Supervision

Paper • 2407.06189 • Published Jul 8 • 24

upvoted an article 4 months ago

Article

AI has a problem with objectifying women

•

May 24

• 54

upvoted a paper 8 months ago

BASE TTS: Lessons from building a billion-parameter Text-to-Speech model on 100K hours of data

Paper • 2402.08093 • Published Feb 12 • 54