Vaibhav Srivastav's picture

Vaibhav Srivastav

reach-vb

·

https://vaibhavs10.github.io

AI & ML interests

TTS + LM performance prediction

Recent Activity

New activity 32 minutes ago

reach-vb/test-gating-f

updated a model 32 minutes ago

reach-vb/test-gating-f

New activity 33 minutes ago

reach-vb/test-gating-f

Articles

Faster Text Generation with Self-Speculative Decoding

Llama can now see and run on your device - welcome Llama 3.2

Google releases Gemma 2 2B, ShieldGemma and Gemma Scope

Llama 3.1 - 405B, 70B & 8B with multilinguality and long context

WWDC 24: Running Mistral 7B with Core ML

Welcome Gemma 2 - Google's new open LLM

Powerful ASR + diarization + speculative decoding with Hugging Face Inference Endpoints

CodeGemma - an official Google release for code LLMs

TTS Arena: Benchmarking Text-to-Speech Models in the Wild

AI Watermarking 101: Tools and Techniques

Deploy MusicGen in no time with Inference Endpoints

Jupyter X Hugging Face

Swift Diffusers: Fast Stable Diffusion for Mac

Organizations

reach-vb's activity

upvoted a paper 2 days ago

BlueLM-V-3B: Algorithm and System Co-Design for Multimodal Large Language Models on Mobile Devices

Paper • 2411.10640 • Published 6 days ago • 37

upvoted 3 collections 4 days ago

Athene-V2

2 items • Updated 7 days ago • 7

LLM2CLIP

LLM2CLIP makes SOTA pretrained CLIP modal more SOTA ever. • 7 items • Updated 2 days ago • 35

🔍 Daily Picks in Interpretability & Analysis of LMs

Outstanding research in interpretability and evaluation of language models, summarized • 82 items • Updated 3 days ago • 91

upvoted a collection 5 days ago

UltraVox Audio Language Model Release 🔊

3 items • Updated 6 days ago • 15

upvoted a paper 6 days ago

AgentInstruct: Toward Generative Teaching with Agentic Flows

Paper • 2407.03502 • Published Jul 3 • 48

upvoted a collection 10 days ago

Qwen2.5-Coder

Code-specific model series based on Qwen2.5 • 40 items • Updated 3 days ago • 223

upvoted 2 collections 16 days ago

🍓 Ichigo v0.3

The experimental family designed to train LLMs to understand sound natively. • 6 items • Updated 10 days ago • 17

llama.vim

Recommended models for the llama.vim plugin • 3 items • Updated 3 days ago • 3

upvoted an article 17 days ago

Article

Recipe: Preparing Multilingual Speech Datasets for TTS Training

By

•

17 days ago

• 14

upvoted a collection 17 days ago

OuteTTS

2 items • Updated 17 days ago • 10

upvoted a collection 19 days ago

AMD-OLMo

AMD-OLMo are a series of 1 billion parameter language models trained by AMD on AMD Instinct™ MI250 GPUs based on OLMo. • 4 items • Updated 21 days ago • 16

upvoted a paper 20 days ago

Lina-Speech: Gated Linear Attention is a Fast and Parameter-Efficient Learner for text-to-speech synthesis

Paper • 2410.23320 • Published 22 days ago • 6

upvoted a collection 21 days ago

MobileLLM

Optimizing Sub-billion Parameter Language Models for On-Device Use Cases (ICML 2024) https://arxiv.org/abs/2402.14905 • 8 items • Updated 14 days ago • 95

upvoted a paper 23 days ago

GPT-4o System Card

Paper • 2410.21276 • Published 27 days ago • 79

upvoted a collection 25 days ago

LongVU

7 items • Updated 21 days ago • 26

upvoted 2 collections about 1 month ago

LayerSkip

Models continually pretrained using LayerSkip - https://arxiv.org/abs/2404.16710 • 8 items • Updated about 1 hour ago • 43

Llama-3.1-Nemotron-70B

SOTA models on Arena Hard and RewardBench as of 1 Oct 2024. • 6 items • Updated Oct 15 • 139

upvoted a paper about 1 month ago

F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching

Paper • 2410.06885 • Published Oct 9 • 40

upvoted an article about 1 month ago

Article

Improving Parquet Dedupe on Hugging Face Hub

Oct 5

• 30