RustyTake-Off's picture

RustyTake-Off

RustyTake-Off

·

RustyTake-Off

AI & ML interests

I'll have what I'm having 🍩

Organizations

None yet

RustyTake-Off's activity

upvoted a collection 3 days ago

OpenCoder

OpenCoder is an open and reproducible code LLM family which matches the performance of top-tier code LLMs. • 6 items • Updated 3 days ago • 65

upvoted 2 collections 12 days ago

The Big Benchmarks Collection

Gathering benchmark spaces on the hub (beyond the Open LLM Leaderboard) • 12 items • Updated about 1 month ago • 155

Open LLM Leaderboard best models ❤️‍🔥

A daily uploaded list of models with best evaluations on the LLM leaderboard: • 57 items • Updated about 4 hours ago • 438

upvoted 3 collections 13 days ago

💻 Local SmolLMs

SmolLM models in MLC, ONNX and GGUF format for local applications + in-browser demos • 14 items • Updated Aug 20 • 45

🪐 SmolLM

A series of smol LLMs: 135M, 360M and 1.7B. We release base and Instruct models as well as the training corpus and some WebGPU demos • 12 items • Updated Aug 18 • 197

SmolLM2

State-of-the-art compact LLMs for on-device applications: 1.7B, 360M, 135M • 8 items • Updated 12 days ago • 167

upvoted 2 collections 19 days ago

C4AI Command R

C4AI Command-R is a research release of a 35 billion parameter highly performant generative model. Command-R is a large language model with open weigh • 4 items • Updated Aug 30 • 19

C4AI Aya Expanse

Aya Expanse is an open-weight research release of a model with highly advanced multilingual capabilities. • 3 items • Updated 23 days ago • 26

upvoted 2 collections 22 days ago

C4AI Aya 23

Aya 23 is an open weights research release of an instruction fine-tuned model with highly advanced multilingual capabilities. • 4 items • Updated Aug 6 • 50

C4AI Command R Plus

C4AI Command R+ is an open weights research release of a 104B billion parameter model with highly advanced capabilities. • 4 items • Updated Aug 30 • 54

upvoted a collection 24 days ago

Stable Diffusion 3.5

6 items • Updated 18 days ago • 91

upvoted 2 collections 25 days ago

Granite 3.0 Language Models

A series of language models trained by IBM licensed under Apache 2.0 license. We release both the base pretrained and instruct models. • 8 items • Updated 12 days ago • 87

Granite Code Models

A series of code models trained by IBM licensed under Apache 2.0 license. We release both the base pretrained and instruct models. • 23 items • Updated 12 days ago • 178

upvoted 3 collections about 1 month ago

Llama-3.1-Nemotron-70B

SOTA models on Arena Hard and RewardBench as of 1 Oct 2024. • 6 items • Updated Oct 15 • 136

MiniCPM

The MiniCPM family of LLMs and VLLMs. • 31 items • Updated 25 days ago • 54

Whisper Release

Whisper includes both English-only and multilingual checkpoints for ASR and ST, ranging from 38M params for the tiny models to 1.5B params for large. • 12 items • Updated Sep 13, 2023 • 87

upvoted 4 collections about 2 months ago

NV-Embed

NV-Embed is a generalist embedding model encompassing retrieval, reranking, classification, clustering, STS tasks. • 3 items • Updated 17 days ago • 9

Molmo

Artifacts for open multimodal language models. • 5 items • Updated 2 days ago • 271

Qwen2.5-Math

Math-specific model series based on Qwen2.5 • 9 items • Updated Sep 23 • 45

Qwen2.5

Qwen2.5 language models, including pretrained and instruction-tuned models of 7 sizes, including 0.5B, 1.5B, 3B, 7B, 14B, 32B, and 72B. • 45 items • Updated Sep 18 • 344