Szymon Ruciński's picture

6 4 36

Szymon Ruciński

szymonrucinski

·

https://szymonrucinski.pl

szymonrucinski

AI & ML interests

NLP & Computer Vision

Organizations

None yet

szymonrucinski's activity

upvoted 2 papers 9 months ago

GaLore: Memory-Efficient LLM Training by Gradient Low-Rank Projection

Paper • 2403.03507 • Published Mar 6 • 182

Efficient Language Adaptive Pre-training: Extending State-of-the-Art Large Language Models for Polish

Paper • 2402.09759 • Published Feb 15 • 1

upvoted a paper 11 months ago

MoE-Mamba: Efficient Selective State Space Models with Mixture of Experts

Paper • 2401.04081 • Published Jan 8 • 71

upvoted a paper 12 months ago

Exponentially Faster Language Modelling

Paper • 2311.10770 • Published Nov 15, 2023 • 118