arxiv:2410.01518
Minsoo Kim
minsoo2333
AI & ML interests
LLM compression
Recent Activity
authored
a paper
about 2 months ago
Enhancing Computation Efficiency in Large Language Models through Weight
and Activation Quantization
authored
a paper
about 2 months ago
Improving Conversational Abilities of Quantized Large Language Models
via Direct Preference Alignment
authored
a paper
about 2 months ago
InfiniPot: Infinite Context Processing on Memory-Constrained LLMs
Organizations
None yet
models
None public yet
datasets
None public yet