AERO: Softmax-Only LLMs for Efficient Private Inference Paper • 2410.13060 • Published 16 days ago • 4 • 2
ReLU's Revival: On the Entropic Overload in Normalization-Free Large Language Models Paper • 2410.09637 • Published 20 days ago • 3 • 2