Allan D Clive's picture

Allan D Clive

allandclive

·

allan_d_clive

AI & ML interests

ASR & TTS

Organizations

allandclive's activity

upvoted a collection 19 days ago

SmolLM2

State-of-the-art compact LLMs for on-device applications: 1.7B, 360M, 135M • 8 items • Updated 15 days ago • 168

upvoted a collection about 2 months ago

Molmo

Artifacts for open multimodal language models. • 5 items • Updated 4 days ago • 271

upvoted 2 collections 2 months ago

Qwen2.5

Qwen2.5 language models, including pretrained and instruction-tuned models of 7 sizes, including 0.5B, 1.5B, 3B, 7B, 14B, 32B, and 72B. • 45 items • Updated Sep 18 • 355

Open Whisper-style Speech Models (OWSM)

Fully open Whisper-style speech foundation models developed by CMU WAVLab: https://www.wavlab.org/activities/2024/owsm/ • 15 items • Updated Sep 27 • 3

upvoted 6 collections 3 months ago

Qwen2-VL

Vision-language model series based on Qwen2 • 15 items • Updated Sep 18 • 155

YOLOv10

This collection hosts the YOLOv10 model releases • 16 items • Updated Jun 3 • 16

⛈️ Llama-3.1 Storm Models

Fine-tuned Llama 3.1 8B model with superior reasoning, conversation abilities, and function calling! • 3 items • Updated Aug 25 • 15

Hermes 3

The Hermes 3 Series of Models • 8 items • Updated Aug 23 • 90

Qwen2-Audio

Audio-language model series based on Qwen2 • 4 items • Updated Sep 18 • 44

Qwen2-Math

Math-specific model series based on Qwen2 • 8 items • Updated Sep 18 • 45

upvoted 3 collections 4 months ago

Gemma 2 2B Release

The 2.6B parameter version of Gemma 2. • 6 items • Updated Jul 31 • 76

Llama 3.1 GPTQ, AWQ, and BNB Quants

Optimised Quants for high-throughput deployments! Compatible with Transformers, TGI & VLLM 🤗 • 9 items • Updated Sep 26 • 54

NuminaMath

Datasets and models for training SOTA math LLMs. See our GitHub for training & inference code: https://github.com/project-numina/aimo-progress-prize • 6 items • Updated Jul 21 • 63

upvoted a paper 4 months ago

Qwen2-Audio Technical Report

Paper • 2407.10759 • Published Jul 15 • 55

upvoted a collection 4 months ago

LLaVa-Interleave

LLaVa models that extends the model capabilities to Multi-image, Multi-frame (videos), Multi-patch (single-image) scenarios. • 3 items • Updated Jul 10 • 14

upvoted an article 4 months ago

Article

SmolLM - blazingly fast and remarkably powerful

Jul 16

• 263

upvoted 4 collections 4 months ago

InternVL 2.0

Expanding Performance Boundaries of Open-Source MLLM • 16 items • Updated 29 days ago • 76

🪐 SmolLM

A series of smol LLMs: 135M, 360M and 1.7B. We release base and Instruct models as well as the training corpus and some WebGPU demos • 12 items • Updated Aug 18 • 198

BigVGAN

BigVGAN is a universal neural vocoder that generates audio waveform using mel spectrogram as input. • 11 items • Updated Oct 1 • 10

H2O Danube3

6 items • Updated Oct 17 • 53