Maxime Labonne's picture

Maxime Labonne PRO

mlabonne

·

https://mlabonne.github.io/blog

AI & ML interests

Post-training, model editing, quantization

Articles

Decoding Strategies in Large Language Models

Fine-tune Llama 3.1 Ultra-Efficiently with Unsloth

The Rise of Agentic Data Generation

Uncensor any LLM with abliteration

Fine-tune Llama 3 with ORPO

Create Mixtures of Experts with MergeKit

Merge Large Language Models with mergekit

Organizations

Posts 5

Post

15630

Large models are surprisingly bad storytellers.

I asked 8 LLMs to "Tell me a bedtime story about bears and waffles."

Claude 3.5 Sonnet and GPT-4o gave me the worst stories: no conflict, no moral, zero creativity.

In contrast, smaller models were quite creative and wrote stories involving talking waffle trees and bears ostracized for their love of waffles.

Here you can see a comparison between Claude 3.5 Sonnet and NeuralDaredevil-8B-abliterated. They both start with a family of bears but quickly diverge in terms of personality, conflict, etc.

I mapped it to the hero's journey to have some kind of framework. Prompt engineering can definitely help here, but it's still disappointing that the larger models don't create better stories right off the bat.

Do you know why smaller models outperform the frontier models here?

Post

16115

✂️ Uncensor any LLM with abliteration

I wrote an article about abliteration and how NeuralDaredevil-8B was created. Beyond removing alignment, I believe it's an interesting technique with a lot of potential. It's basically fine-tuning without retraining.

In this article, we see how it works, implement it in Google Colab, and heal the abliterated model to recover the performance drop due to this technique. The final model is an uncensored and high-quality model with the highest MMLU score on the Open LLM Leaderboard (8B category).

https://huggingface.co/blog/mlabonne/abliteration

Collections 10

Papers 7

arxiv:2410.08371

arxiv:2305.15016

arxiv:2304.01238

arxiv:2112.02417

spaces 19

Model Family Tree

Yet Another LLM Leaderboard

AutoMerger

TwinLlama-3.1-8B-DPO

Running on Zero

TwinLlama-3.1-8B

FineLlama-3.1-8B

models 145

mlabonne/dummy-llama-2

Text Generation • Updated 5 days ago • 1.19k • 8

mlabonne/Hermes-3-Llama-3.1-70B-lorablated

Text Generation • Updated Oct 16 • 357 • 22

mlabonne/TwinLlama-3.1-8B-DPO-GGUF

Updated Oct 6 • 17 • 2

mlabonne/TwinLlama-3.1-8B-DPO

Text Generation • Updated Oct 6 • 80 • 3

mlabonne/TwinLlama-3.1-8B-GGUF

Updated Oct 6 • 29 • 2

mlabonne/TwinLlama-3.1-8B

Text Generation • Updated Oct 6 • 403 • 7

mlabonne/UltraLlama-3.1-8B

mlabonne/BigQwen2.5-Echo-47B-Instruct

Text Generation • Updated Sep 29 • 14 • 3

mlabonne/BigQwen2.5-52B-Instruct

Text Generation • Updated Sep 29 • 34 • 2

mlabonne/BigQwen2.5-125B-Instruct

Text Generation • Updated Sep 25 • 38 • 10

datasets 47

mlabonne/orca-agentinstruct-1M-v1-cleaned

Viewer • Updated 3 days ago • 1.05M • 254 • 36

mlabonne/open-perfectblend

Viewer • Updated Oct 18 • 1.42M • 885 • 44

mlabonne/lmsys-arena-human-preference-55k-sharegpt

Viewer • Updated Oct 18 • 57.4k • 109 • 4

mlabonne/orpo-dpo-mix-40k

Viewer • Updated Oct 17 • 44.2k • 1.62k • 248

mlabonne/ultrachat_200k_sft

Viewer • Updated Oct 13 • 208k • 441 • 1

mlabonne/orca-math-word-problems-80k

Viewer • Updated Sep 23 • 80k • 54 • 4

mlabonne/llmtwin-dpo

Viewer • Updated Aug 30 • 1.63k • 57 • 1

mlabonne/llmtwin

Viewer • Updated Aug 27 • 3.34k • 87 • 7

mlabonne/lmsys-arena-human-preference-filtered-19k

Viewer • Updated Aug 15 • 57.5k • 58 • 4

mlabonne/FineTome-Alpaca-100k

Viewer • Updated Aug 2 • 100k • 74 • 1