Data Engineering for Scaling Language Models to 128K Context Paper • 2402.10171 • Published Feb 15 • 22
LongRoPE: Extending LLM Context Window Beyond 2 Million Tokens Paper • 2402.13753 • Published Feb 21 • 112
Tulu V2 Suite Collection The set of models associated with the paper "Camels in a Changing Climate: Enhancing LM Adaptation with Tulu 2" • 19 items • Updated 7 days ago • 42
🚀GGUF Collection Llama.cpp compatible models, can be used on CPUs and GPUs! • 871 items • Updated 6 days ago • 35
BLING Models Collection Small CPU-based RAG-optimized, instruct-following 1B-3B parameter models • 27 items • Updated 25 days ago • 25
Recent models: last 100 repos, sorted by creation date Collection The last 100 repos I have created. Sorted by creation date descending, so the most recently created repos appear at the top. • 121 items • Updated Jan 31 • 505