Maxime Labonne's picture

Maxime Labonne PRO

mlabonne

·

https://mlabonne.github.io/blog

AI & ML interests

Post-training, model editing, quantization

Recent Activity

New activity about 1 hour ago

mlabonne/orca-agentinstruct-1M-v1-cleaned

liked a dataset about 1 hour ago

HuggingFaceTB/smoltalk

Articles

Decoding Strategies in Large Language Models

Fine-tune Llama 3.1 Ultra-Efficiently with Unsloth

The Rise of Agentic Data Generation

Uncensor any LLM with abliteration

Fine-tune Llama 3 with ORPO

Create Mixtures of Experts with MergeKit

Merge Large Language Models with mergekit

Organizations

mlabonne's activity

upvoted an article 3 days ago

Article

The Beginners Guide to Cleaning a Dataset

By

•

3 days ago

• 21

upvoted an article 8 days ago

Article

Releasing the largest multilingual open pretraining dataset

By

•

8 days ago

• 94

upvoted an article 23 days ago

Article

Decoding Strategies in Large Language Models

By

•

23 days ago

• 38

upvoted a paper 25 days ago

Back to Basics: Revisiting REINFORCE Style Optimization for Learning from Human Feedback in LLMs

Paper • 2402.14740 • Published Feb 22 • 11

upvoted an article about 2 months ago

Article

VLM Art Analysis

By

•

Oct 4

• 11

upvoted an article 2 months ago

Article

Fine-tuning LLMs to 1.58bit: extreme quantization made easy

Sep 18

• 202

upvoted a collection 3 months ago

🧠 Abliteration

Uncensored models using abliteration. See this article for more information: huggingface.co/blog/mlabonne/abliteration • 7 items • Updated 3 days ago • 22

upvoted an article 3 months ago

Article

Introduction to ggml

Aug 13

• 113

upvoted a paper 4 months ago

The Impact of Hyperparameters on Large Language Model Inference Performance: An Evaluation of vLLM and HuggingFace Pipelines

Paper • 2408.01050 • Published Aug 2 • 8

upvoted an article 4 months ago

Article

The case for specialized pre-training: ultra-fast foundation models for dedicated tasks

By

•

Aug 4

• 26

upvoted a paper 4 months ago

Improving Text Embeddings for Smaller Language Models Using Contrastive Fine-tuning

Paper • 2408.00690 • Published Aug 1 • 22

upvoted a collection 4 months ago

Probably function calling datasets

Created using the https://huggingface.co/spaces/librarian-bots/dataset-column-search-api Space. • 39 items • Updated Jul 17 • 36

upvoted 2 papers 4 months ago

Executable Code Actions Elicit Better LLM Agents

Paper • 2402.01030 • Published Feb 1 • 27

Understanding Reference Policies in Direct Preference Optimization

Paper • 2407.13709 • Published Jul 18 • 16

upvoted 3 collections 4 months ago

Bad Data Toolbox

PleIAs collection of models for the data processing of challenging document and data sources. • 5 items • Updated Jul 18 • 11

OpenCulture

A multilingual dataset of public domain books and newspapers. • 27 items • Updated 15 days ago • 117

Finance Commons

A large collection of multimodal financial documents in open data. • 7 items • Updated Jul 17 • 4

upvoted an article 4 months ago

Article

Fine-tune Llama 3.1 Ultra-Efficiently with Unsloth

By

•

Jul 29

• 244

upvoted a paper 4 months ago

The Importance of Online Data: Understanding Preference Fine-tuning via Coverage

Paper • 2406.01462 • Published Jun 3 • 6

upvoted an article 4 months ago

Article

The Rise of Agentic Data Generation

By

•

Jul 15

• 78