Yoshi Suhara's picture

3 3 5

Yoshi Suhara

suhara

·

https://yoshi-suhara.com/

AI & ML interests

None yet

Recent Activity

New activity about 1 month ago

nvidia/Mistral-NeMo-Minitron-8B-Instruct

New activity about 1 month ago

nvidia/Mistral-NeMo-Minitron-8B-Instruct

liked a model about 2 months ago

nvidia/Mistral-NeMo-Minitron-8B-Instruct

Organizations

suhara's activity

upvoted a paper about 2 months ago

MaskLLM: Learnable Semi-Structured Sparsity for Large Language Models

Paper • 2409.17481 • Published Sep 26 • 46

upvoted a collection 3 months ago

Minitron

A family of compressed models obtained via pruning and knowledge distillation • 9 items • Updated Oct 3 • 59

upvoted a paper 3 months ago

LLM Pruning and Distillation in Practice: The Minitron Approach

Paper • 2408.11796 • Published Aug 21 • 55