view article Article Releasing the largest multilingual open pretraining dataset By Pclanglais • 9 days ago • 94
view article Article A failed experiment: Infini-Attention, and why we should keep trying? Aug 14 • 50
view article Article Jack of All Trades, Master of Some, a Multi-Purpose Transformer Agent Apr 22 • 78
Can Mamba Learn How to Learn? A Comparative Study on In-Context Learning Tasks Paper • 2402.04248 • Published Feb 6 • 30
Large Language Models as Generalizable Policies for Embodied Tasks Paper • 2310.17722 • Published Oct 26, 2023 • 6
Retroformer: Retrospective Large Language Agents with Policy Gradient Optimization Paper • 2308.02151 • Published Aug 4, 2023 • 18