vikas (Vikas Kumar)

upvoted an article about 1 month ago

Article

The 5 Most Under-Rated Tools on Hugging Face

Aug 22

• 81

upvoted 2 papers about 2 months ago

Transformer Explainer: Interactive Learning of Text-Generative Models

Paper • 2408.04619 • Published Aug 8 • 152

Magpie: Alignment Data Synthesis from Scratch by Prompting Aligned LLMs with Nothing

Paper • 2406.08464 • Published Jun 12 • 62

upvoted an article 2 months ago

Article

Finetuning PaliGemma with AutoTrain

By

•

Jul 25

• 7

upvoted a collection 2 months ago

Gemma 2 2B Release

Collection

The 2.6B parameter version of Gemma 2. • 6 items • Updated Jul 31 • 76

upvoted 2 articles 2 months ago

Article

Fine-tune Llama 3.1 Ultra-Efficiently with Unsloth

By

•

Jul 29

• 206

Article

Llama 3.1 - 405B, 70B & 8B with multilinguality and long context

Jul 23

• 197

upvoted a collection 3 months ago

🪐 SmolLM

Collection

A series of smol LLMs: 135M, 360M and 1.7B. We release base and Instruct models as well as the training corpus and some WebGPU demos • 12 items • Updated Aug 18 • 174

upvoted 3 articles 3 months ago

Article

SmolLM - blazingly fast and remarkably powerful

Jul 16

• 244

Article

The Rise of Agentic Data Generation

By

•

Jul 15

• 74

Article

ColPali: Efficient Document Retrieval with Vision Language Models 👀

By

•

Jul 5

• 107

upvoted a collection 3 months ago

Florence

Collection

9 items • Updated Jul 11 • 154

upvoted 2 articles 3 months ago

Article

Fine-tuning Florence-2 - Microsoft's Cutting-edge Vision Language Models

Jun 24

• 168

Article

BM25 for Python: Achieving high performance while simplifying dependencies with BM25S⚡

By

•

Jul 9

• 35

upvoted an article 4 months ago

Article

Putting RL back in RLHF

Jun 12

• 60

upvoted a paper 5 months ago

What matters when building vision-language models?

Paper • 2405.02246 • Published May 3 • 98

upvoted an article 5 months ago

Article

Introducing Idefics2: A Powerful 8B Vision-Language Model for the community

Apr 15

• 161

upvoted a collection 5 months ago

InternVL 1.0

Collection

Scaling up Vision Foundation Models and Aligning for Generic Visual-Linguistic Tasks • 16 items • Updated Jun 27 • 15

upvoted 3 articles 5 months ago

Article

Welcome Llama 3 - Meta's new open LLM

Apr 18

• 273

Article

How to Finetune phi-3 on MacBook Pro

By

•

Apr 24

• 62

Article

Vision Language Models Explained

Apr 11

• 183

upvoted a collection 5 months ago

Phi-3

Collection

Phi-3 family of small language and multi-modal models. Language models are available in short- and long-context lengths. • 27 items • Updated 13 days ago • 470

upvoted 3 collections 6 months ago

upvoted a collection 9 months ago

Zeroshot Classifiers

Collection

These are my current best zeroshot classifiers. Some of my older models are downloaded more often, but the models in this collection are newer/better. • 11 items • Updated Apr 3 • 103

upvoted 2 papers 9 months ago

Self-Play Fine-Tuning Converts Weak Language Models to Strong Language Models

Paper • 2401.01335 • Published Jan 2 • 64

Mixtral of Experts

Paper • 2401.04088 • Published Jan 8 • 157

Vikas Kumar

AI & ML interests

Organizations

vikas's activity

The 5 Most Under-Rated Tools on Hugging Face

Transformer Explainer: Interactive Learning of Text-Generative Models

Magpie: Alignment Data Synthesis from Scratch by Prompting Aligned LLMs with Nothing

Finetuning PaliGemma with AutoTrain

Gemma 2 2B Release

Fine-tune Llama 3.1 Ultra-Efficiently with Unsloth

Llama 3.1 - 405B, 70B & 8B with multilinguality and long context

🪐 SmolLM

SmolLM - blazingly fast and remarkably powerful

The Rise of Agentic Data Generation

ColPali: Efficient Document Retrieval with Vision Language Models 👀

Florence

Fine-tuning Florence-2 - Microsoft's Cutting-edge Vision Language Models

BM25 for Python: Achieving high performance while simplifying dependencies with BM25S⚡

Putting RL back in RLHF

What matters when building vision-language models?

Introducing Idefics2: A Powerful 8B Vision-Language Model for the community

InternVL 1.0

Welcome Llama 3 - Meta's new open LLM

How to Finetune phi-3 on MacBook Pro

Vision Language Models Explained

Phi-3

[lecture artifacts] aligning open language models

PDF Document / OCR Datasets

LLaVa-NeXT

Zeroshot Classifiers

Self-Play Fine-Tuning Converts Weak Language Models to Strong Language Models

Mixtral of Experts

Vikas Kumar

AI & ML interests

Organizations

vikas's activity

The 5 Most Under-Rated Tools on Hugging Face

Finetuning PaliGemma with AutoTrain

Fine-tune Llama 3.1 Ultra-Efficiently with Unsloth

Llama 3.1 - 405B, 70B & 8B with multilinguality and long context

SmolLM - blazingly fast and remarkably powerful

The Rise of Agentic Data Generation

ColPali: Efficient Document Retrieval with Vision Language Models 👀

Fine-tuning Florence-2 - Microsoft's Cutting-edge Vision Language Models

BM25 for Python: Achieving high performance while simplifying dependencies with *BM25S*⚡

Putting RL back in RLHF

Introducing Idefics2: A Powerful 8B Vision-Language Model for the community

Welcome Llama 3 - Meta's new open LLM

How to Finetune phi-3 on MacBook Pro

Vision Language Models Explained

BM25 for Python: Achieving high performance while simplifying dependencies with BM25S⚡