Nathan Habib's picture

Nathan Habib

SaylorTwift

·

AI & ML interests

None yet

Recent Activity

reacted to elliesleightholm's post with 🤗 30 minutes ago

posted an update about 2 hours ago

reacted to Symbol-LLM's post with 🔥 about 2 hours ago

Articles

Open LLM Leaderboard: DROP deep dive

What's going on with the Open LLM Leaderboard?

Organizations

SaylorTwift's activity

upvoted a paper about 2 months ago

HelloBench: Evaluating Long Text Generation Capabilities of Large Language Models

Paper • 2409.16191 • Published Sep 24 • 41

upvoted a collection about 2 months ago

Qwen2.5

Qwen2.5 language models, including pretrained and instruction-tuned models of 7 sizes, including 0.5B, 1.5B, 3B, 7B, 14B, 32B, and 72B. • 45 items • Updated Sep 18 • 369

upvoted a paper 2 months ago

Qwen2.5-Coder Technical Report

Paper • 2409.12186 • Published Sep 18 • 136

upvoted a collection 2 months ago

Open LLM Leaderboard best models ❤️‍🔥

A daily uploaded list of models with best evaluations on the LLM leaderboard: • 57 items • Updated 28 minutes ago • 442

upvoted 4 articles 3 months ago

Article

The 5 Most Under-Rated Tools on Hugging Face

Aug 22

• 85

Article

XetHub is joining Hugging Face!

Aug 8

• 80

Article

Tool Use, Unified

Aug 12

• 64

Article

Welcome FalconMamba: The first strong attention-free 7B model

Aug 12

• 102

upvoted an article 6 months ago

Article

Let's talk about LLM evaluation

By

•

May 23

• 134

upvoted a collection 11 months ago

Model Merging

Model Merging is a very popular technique nowadays in LLM. Here is a chronological list of papers on the space that will help you get started with it! • 30 items • Updated Jun 12 • 217

upvoted 3 papers about 1 year ago

Zephyr: Direct Distillation of LM Alignment

Paper • 2310.16944 • Published Oct 25, 2023 • 122

Judging LLM-as-a-judge with MT-Bench and Chatbot Arena

Paper • 2306.05685 • Published Jun 9, 2023 • 29

Retentive Network: A Successor to Transformer for Large Language Models

Paper • 2307.08621 • Published Jul 17, 2023 • 170

upvoted a paper over 1 year ago

The RefinedWeb Dataset for Falcon LLM: Outperforming Curated Corpora with Web Data, and Web Data Only

Paper • 2306.01116 • Published Jun 1, 2023 • 31