OneGen: Efficient One-Pass Unified Generation and Retrieval for LLMs Paper • 2409.05152 • Published 27 days ago • 29
ULLME: A Unified Framework for Large Language Model Embeddings with Generation-Augmented Learning Paper • 2408.03402 • Published Aug 6 • 1
CulturaX: A Cleaned, Enormous, and Multilingual Dataset for Large Language Models in 167 Languages Paper • 2309.09400 • Published Sep 17, 2023 • 82