Star Attention: Efficient LLM Inference over Long Sequences Paper • 2411.17116 • Published 5 days ago • 41
view article Article Releasing the largest multilingual open pretraining dataset By Pclanglais • 17 days ago • 96