Jordan Gong's picture

43 46

Jordan Gong

jordangong

·

AI & ML interests

None yet

Organizations

None yet

jordangong's activity

upvoted a collection about 1 month ago

Llama 3.2

This collection hosts the transformers and original repos of the Llama 3.2 and Llama Guard 3 • 15 items • Updated 12 days ago • 440

upvoted 3 collections about 2 months ago

Qwen2.5-Coder

Code-specific model series based on Qwen2.5 • 14 items • Updated Sep 25 • 83

Qwen2.5-Math

Math-specific model series based on Qwen2.5 • 9 items • Updated Sep 23 • 41

Qwen2.5

Qwen2.5 language models, including pretrained and instruction-tuned models of 7 sizes, including 0.5B, 1.5B, 3B, 7B, 14B, 32B, and 72B. • 45 items • Updated Sep 18 • 301

upvoted a collection 3 months ago

Llama 3.1

This collection hosts the transformers and original repos of the Llama 3.1, Llama Guard 3 and Prompt Guard models • 11 items • Updated Sep 25 • 609

upvoted 4 collections 4 months ago

🪐 SmolLM

A series of smol LLMs: 135M, 360M and 1.7B. We release base and Instruct models as well as the training corpus and some WebGPU demos • 12 items • Updated Aug 18 • 187

Tulu V2.5 Suite

A suite of models trained using DPO and PPO across a wide variety (up to 14) of preference datasets. See https://arxiv.org/abs/2406.09279 for more! • 44 items • Updated 22 days ago • 14

Gemma 2 Release

15 items • Updated Sep 9 • 192

DeepSeekCoder-V2

6 items • Updated Sep 5 • 82

upvoted 3 collections 5 months ago

Qwen2

Qwen2 language models, including pretrained and instruction-tuned models of 5 sizes, including 0.5B, 1.5B, 7B, 57B-A14B, and 72B. • 39 items • Updated Sep 18 • 346

GLM-4

GLM-4 Open Models • 13 items • Updated 11 days ago • 110

K2

K2, LLM360's most powerful, scaled model series. • 7 items • Updated 30 days ago • 7

upvoted a collection 6 months ago

PaliGemma Release

Pretrained and mix checkpoints for PaliGemma • 16 items • Updated Jul 31 • 137

upvoted an article 6 months ago

Article

PaliGemma – Google's Cutting-Edge Open Vision Language Model

May 14

• 204

upvoted 5 collections 6 months ago

OLMo Suite

Artifacts for the first set of OLMo models. • 18 items • Updated Sep 25 • 65

Yi-1.5 (2024/05)

10 items • Updated May 20 • 90

Mantis

Mantis model family optimized for multi-image reasoning with interleaved text/image format • 11 items • Updated Jul 2 • 8

VILA: On Pre-training for Visual Language Models

10 items • Updated 6 days ago • 45

Granite Code Models

A series of code models trained by IBM licensed under Apache 2.0 license. We release both the base pretrained and instruct models. • 23 items • Updated 1 day ago • 175

upvoted a collection 7 months ago

Phi-3

Phi-3 family of small language and multi-modal models. Language models are available in short- and long-context lengths. • 27 items • Updated 6 days ago • 486