Kimiko's picture

Kimiko

Chat-Error

·

AI & ML interests

None yet

Organizations

Chat-Error's activity

upvoted a paper 7 months ago

Rho-1: Not All Tokens Are What You Need

Paper • 2404.07965 • Published Apr 11 • 84

upvoted a paper 8 months ago

LLaVA-Gemma: Accelerating Multimodal Foundation Models with a Compact Language Model

Paper • 2404.01331 • Published Mar 29 • 25

upvoted a paper 9 months ago

The Era of 1-bit LLMs: All Large Language Models are in 1.58 Bits

Paper • 2402.17764 • Published Feb 27 • 603

upvoted 2 papers 10 months ago

Rethinking Optimization and Architecture for Tiny Language Models

Paper • 2402.02791 • Published Feb 5 • 12

SPT: Fine-Tuning Transformer-based Language Models Efficiently with Sparsification

Paper • 2312.10365 • Published Dec 16, 2023 • 1

upvoted 2 papers 11 months ago

Designing a Better Asymmetric VQGAN for StableDiffusion

Paper • 2306.04632 • Published Jun 7, 2023 • 3

Pretraining on the Test Set Is All You Need

Paper • 2309.08632 • Published Sep 13, 2023 • 3

upvoted a paper about 1 year ago

FLM-101B: An Open LLM and How to Train It with $100K Budget

Paper • 2309.03852 • Published Sep 7, 2023 • 44

upvoted a paper over 1 year ago

Extending Context Window of Large Language Models via Positional Interpolation

Paper • 2306.15595 • Published Jun 27, 2023 • 53