DeepSpeed-FastGen: High-throughput Text Generation for LLMs via MII and DeepSpeed-Inference Paper • 2401.08671 • Published Jan 9 • 14
DePlot: One-shot visual language reasoning by plot-to-table translation Paper • 2212.10505 • Published Dec 20, 2022 • 1
Natural Language Inference over Interaction Space: ICLR 2018 Reproducibility Report Paper • 1802.03198 • Published Feb 9, 2018 • 1
LM Transparency Tool: Interactive Tool for Analyzing Transformer Language Models Paper • 2404.07004 • Published Apr 10 • 6
RecurrentGemma: Moving Past Transformers for Efficient Open Language Models Paper • 2404.07839 • Published Apr 11 • 43