Bram Vanroy's picture

Bram Vanroy PRO

BramVanroy

·

https://bramvanroy.github.io/

AI & ML interests

Artificial intelligence, natural language processing, computational linguistics

Recent Activity

New activity 1 day ago

HPLT/hplt_bert_base_fr

liked a model 2 days ago

New activity 7 days ago

ivdnt/galahad-corpus-data

Organizations

Posts 11

Post

1537

The InstructGPT paper mentions that they insert 10% pretraining data during SFT, which they find improves the effect of PPO (IIUC). Has anyone else done later ablations on this? I've only seen the inverse suggested, mixing in SFT data during pretraining.

Post

2233

All my models seem to be plagued by infinite lists. When you ask a question that requires it to write a list, it most often keeps adding bullet points or enumeration. I am wondering whether this is a result of using chatty GPT-4 as DPO preferences. Any thoughts?

Collections 7

Papers 1

arxiv:2312.12852

spaces 5

Running on Zero

Fietje

An efficient, open LMM for Dutch

Text To AMR

Dutch Simplification

Open Dutch LLM Leaderboard

MATEO

models 38

BramVanroy/fietje-2

Text Generation • Updated 23 days ago • 686 • 6

BramVanroy/fietje-2-instruct

Text Generation • Updated 23 days ago • 1.03k • 2

BramVanroy/fietje-2-chat

Text Generation • Updated 23 days ago • 2.58k • 1

BramVanroy/GEITje-7B-ultra

Text Generation • Updated 23 days ago • 789 • 37

BramVanroy/GEITje-7B-ultra-GGUF

Updated Sep 5 • 442 • 6

BramVanroy/fietje-2-chat-gguf

Updated Aug 27 • 101 • 4

BramVanroy/fietje-2-instruct-gguf

Updated Aug 27 • 76 • 2

BramVanroy/fietje-2-gguf

Updated Aug 27 • 100 • 1

BramVanroy/tweety-7b-dutch-v24a-GGUF

Updated May 9 • 56 • 1

BramVanroy/fietje-3-mini-4k-instruct-GGUF

Updated May 5 • 63 • 2

datasets 26

BramVanroy/lmsys-20240814-nl

Viewer • Updated about 1 month ago • 2.75k • 15

BramVanroy/en-to-la-instruct

Viewer • Updated Aug 23 • 52

BramVanroy/stack_md_lid

Viewer • Updated Aug 22 • 21M • 394 • 4

BramVanroy/Openhermes-2.5-dutch-46k-format

Viewer • Updated Aug 21 • 43.7k • 59

BramVanroy/fietje-2-data

Viewer • Updated Jun 4 • 13.8M • 78

BramVanroy/occiglot-fineweb-v0.5-nl

Viewer • Updated Jun 3 • 16.1M • 113 • 1

BramVanroy/no_robots_dutch

Viewer • Updated Jun 1 • 8.61k • 80 • 2

BramVanroy/ultra_feedback_dutch_cleaned

Viewer • Updated May 13 • 183k • 259 • 3

BramVanroy/WildChat-1M-filtered-gpt-4

Viewer • Updated May 4 • 139k • 52

BramVanroy/orca_dpo_pairs_dutch_cleaned

Viewer • Updated Apr 24 • 31.6k • 61 • 2