Star Attention: Efficient LLM Inference over Long Sequences Paper • 2411.17116 • Published 2 days ago • 30
view article Article Releasing the largest multilingual open pretraining dataset By Pclanglais • 14 days ago • 95