Optimizations Exponentially Faster Language Modelling Paper • 2311.10770 • Published Nov 15, 2023 • 118 Scaling Data-Constrained Language Models Paper • 2305.16264 • Published May 25, 2023 • 17 LLM in a flash: Efficient Large Language Model Inference with Limited Memory Paper • 2312.11514 • Published Dec 12, 2023 • 258
LLM in a flash: Efficient Large Language Model Inference with Limited Memory Paper • 2312.11514 • Published Dec 12, 2023 • 258