Jovillios (Jules Decaestecker)

upvoted 9 papers 4 months ago

DeepSeek-Coder-V2: Breaking the Barrier of Closed-Source Models in Code Intelligence

Paper • 2406.11931 • Published Jun 17 • 56

Perplexed by Perplexity: Perplexity-Based Data Pruning With Small Reference Models

Paper • 2405.20541 • Published May 30 • 20

Transformers are SSMs: Generalized Models and Efficient Algorithms Through Structured State Space Duality

Paper • 2405.21060 • Published May 31 • 63

LLMs achieve adult human performance on higher-order theory of mind tasks

Paper • 2405.18870 • Published May 29 • 16

upvoted a paper 5 months ago

RLHF Workflow: From Reward Modeling to Online RLHF

Paper • 2405.07863 • Published May 13 • 67

upvoted an article 5 months ago

Article

🦙⚗️ Using Llama3 and distilabel to build fine-tuning datasets

By

•

Jun 4

• 69

upvoted a paper 5 months ago

LoRA Land: 310 Fine-tuned LLMs that Rival GPT-4, A Technical Report

Paper • 2405.00732 • Published Apr 29 • 118

upvoted an article 5 months ago

Article

Fine-tune Llama 3 with ORPO

By

•

Apr 22

• 221

upvoted 3 papers 5 months ago

ORPO: Monolithic Preference Optimization without Reference Model

Paper • 2403.07691 • Published Mar 12 • 60

Prometheus 2: An Open Source Language Model Specialized in Evaluating Other Language Models

Paper • 2405.01535 • Published May 2 • 114

Make Your LLM Fully Utilize the Context

Paper • 2404.16811 • Published Apr 25 • 52

upvoted an article 5 months ago

Article

Welcome Llama 3 - Meta's new open LLM

Apr 18

• 273

Jules Decaestecker

AI & ML interests

Organizations

Jovillios's activity

DeepSeek-Coder-V2: Breaking the Barrier of Closed-Source Models in Code Intelligence

mDPO: Conditional Preference Optimization for Multimodal Large Language Models

CRAG -- Comprehensive RAG Benchmark

Autoregressive Model Beats Diffusion: Llama for Scalable Image Generation

BitsFusion: 1.99 bits Weight Quantization of Diffusion Model

RAFT: Adapting Language Model to Domain Specific RAG

Perplexed by Perplexity: Perplexity-Based Data Pruning With Small Reference Models

Transformers are SSMs: Generalized Models and Efficient Algorithms Through Structured State Space Duality

LLMs achieve adult human performance on higher-order theory of mind tasks

RLHF Workflow: From Reward Modeling to Online RLHF

🦙⚗️ Using Llama3 and distilabel to build fine-tuning datasets

LoRA Land: 310 Fine-tuned LLMs that Rival GPT-4, A Technical Report

Fine-tune Llama 3 with ORPO

ORPO: Monolithic Preference Optimization without Reference Model

Prometheus 2: An Open Source Language Model Specialized in Evaluating Other Language Models

Make Your LLM Fully Utilize the Context

Welcome Llama 3 - Meta's new open LLM